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(54) Tide: PROCESS FOR BACTERIAL PRODUCTTION OF POLYPEPTIDES 
(57) Abstract 

A process is provided for producing a 
heterologous polypeptide in bacteria. This 
process comprises, in a first step, cultur- 
ing bacterial cells that lack their native pstS 
gene and comprise nucleic acid encoding a 
PstS variant having an amino acid varia- 
tion within the phosphate-binding region of 
the corresponding native PstS^ nucleic acid 
encoding a DsbA or DsbC protein, nucleic 
acid encoding the heterologous polypeptide, 
a signal sequence for secretion of both the 
DsbA or DsbC protein and the heterologous 
polypeptide, an inducible promoter for the 
nucleic acid encoding the DsbA or DsbC 
protein, and an alkaline phosphatase pro- 
moter for the nucleic acid encoding the het- 
erologous polypeptide. The nucleic acid en- 
coding a PstS variant is under the transcrip- 
tional control of the wild-type pstS gene 
promoter. The culturing step is conducted 
under conditions whereby expression of the 
nucleic acid encoding the DsbA or DsbC 
protein is induced prior to induction of the 
expression of the nucleic acid encoding the 
heterologous polypeptide, whereby the cul- 
turing takes place in a culture medium at 
a concentration of inorganic phosphate in 
the medium that during all phases of cell 
growth is above the level at which the cells are starved for phosphate, and whereby either both the heterologous polypeptide and the DsbA 
or DsbC protein are secreted into the periplasm of the bacteria or the heterologous polypeptide is secreted into the medium in which the 
bacterial cells are cultured. The second step of the process involves recovering the heterologous polypeptide from the periplasm or the 
culture medium. 
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PROCESS FOR BACTERIAL PRODUCTION OF POLYPEPTIDES 
Background of the Invention 

Field of the Invention 

This invention relates to a process for producing and secreting polypeptides in bacteria by 
5 overproduction and secretion of a DsbA or DsbC protein. 
Description of Related Disclosures 

Much research has been conducted in the field of protein folding since the seminal publication of 
Anfmsen et a/., Proc. NatL Acad. Scl USA. 42: 1309-13 14 (1961) showing that, in vitro, reduced and denatured 
ribonuclease could refold into the active enzyme witii tt\e fonnation of suitable disulfide bonds. Later, a catalyst 
10 responsible for oxidative folding in eukaiyotes was discovered, called protein disulfide isomerase (PDI). 

Two types of proteins that assist in protein folding have been described: non-catalytic molecular 
chaperones that presumably prevent improper interactions leading to aggregation and events other than proper 
folding, and catalysts for two steps in protein folding, cis-trans prolyl isomerization and disulfide bond formation. 
While clear evidence for an in vivo requirement of prolyl isomerase activity is still lacking, the relatively recent 
1 5 isolation of mutants that are severely defective in disulfide bond formation has confumed that this latter folding 
step is catalyzed in vivo. 

PDI has been implicated in the catalysis of disulfide bond fonnation and rearrangement through in vitro 
data. Creighton et al., J. Mol. Biol.. 142; 43-62 (1980); Freedman et al„ Biochem. Soc. Svmp.. 5i: 167-192 
(1989). See also Bardwell and Beckwith,CglL 74: 769-771 (1993). In addition, yeast mutants in PDI that foil 

20 to form disulfide bonds in carboxypeptidase Y have been identified, indicating the importance of PDI in disulfide 
bond formation in vivo, LaMantia and Lennarz, 74: 899-908 (1993). 

In prokaryotes, mutations in a specific gene identified in the periplasm of E. coli by three independent 
investigating groups show a dramatic and pleiou-opic decrease in the rate of disulfide bond formation m secreted 
proteins. The protein encoded by this gene was variously designated as DsbA (disulfide bond) by Bardweil et 

25 a/.. Cell 62: 581-589 (1991) and Missiakas et aL, Proc. Natl. Acad. Sci. USA. 20: 7084-7088 (1993), and as 
PpfA (periplasmic phosphatase/protein formation) by Kamitani et al, EMBO J.. JJ.: 57-62 (1992). It is 
hereinafter referred to by the Dsb designation. The secreted proteins affected by this decrease in disulfide bond 
formation range &om proteins endogenous to E, coli such as alkaline phosphatase and OmpA to the recombinant 
mammalian proteins, urokinase and tissue plasminogen activator. It is now clear that the formation of disulfide 

30 bonds is a catalyzed process in both eukaryotes and prokaryotes. For a review on PDI and DsbA, see Bardweil 
and Beckwith, Cell, supra (1993). 

The DsbA protein has a highly reactive disulfide bond, similar to disulfide isomerase. Noiva and 
Lennarz, J. Biol. Chem. . 267 : 3553-3556 (1992); Bulleid, Adv. Prot.. 44: 125-150 (1993). Its crystal structure 
has recently been solved (Martin et al. Nature. 365 : 464-468 [1993]) and its redox potential has been 

35 determined. Wunderlich and Glockshuber, Prot. Sci.. 2: 71 7-726 (1 993). It is a significantly stronger oxidant 
than the cytoplasmic thioredoxin and it more closely resembles the eukaiyotic disulfide isomerases. It was found 
that unfolding DsbA stabilizes the reactive disulfide bond by about 1 8.9 kJmof^ Zapun et ai^ Biochemistrv. 
22: 5083-5092 (1993). This result is interpreted to indicate that the disulfide bond destabilizes the folded form 
of DsbA, thereby conferring its high energy content. 
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The gene of a second protein involved in disulfide bond fonnation, called dsbB^ was cloned by two 
independent groups. Bardwell et ai. Proc. Natl. Acad. Sci. USA. 90: 1038-1042 (1993) and Missiakase/ oil, 
1993, supra, DsbB is an integral membrane protein spanning the inner membrane. It appears to be involved in 
the re-oxidation of DsbA and thus may form part of a chain that links an- electron transfer step to the fonnation 
5 of disulfide bonds in the periplasm. Bardwell et al, supra (1993). A third gene for disulfide bond formation, 
dsbC, was recently identified Missiakas et fl/.. EMBOJ.. 13: 2013-2020 (1994); Shevchik et al, EMBO J.. 12: 
2007-2012 (1994). It encodes a 26-kDa periplasmic protein that can functionally substitute for DsbA. 

It has been shown that DsbA is required for the rapid fonnation of disulfide bonds of periplasmic £1 
colt proteins in vivo and in vitro. The same was found for the production of recombinant eukaryotic proteins in 

10 the periplasm of E, coii, e.g., different serine proteases (Bardwell et al., supra [1993]), antibody fragmmits 
(Knappik et al., Bio/Technolo^. H: 77-83 [1993]), and firagments of a T-cell receptor. The firaction of tiie 
recombinant molecules that becomes correctly folded can be smaller than in the case of natural periplasmic 
proteins, and is very sequence dependent. Overproduction of DsbA did not help to increase the proportion of 
correctly folded periplasmic antibody fragments (Knappik et al., supra), indicatmg that other steps limit their 

15 folding in the periplasm. Furthermore, folding of the 

a-amylase/trypsin inhibitor from Ragi in the periplasm was not improved when the dsbA gene was co-expressed 
and DsbA protein co-secreted without reduced glutathione present in the growth medium, but an increase in 
correctly folded secreted inhibitor was observed by co-expression of dsbA in conjunction with the addition of 
reduced glutathione to the medium. Wttnderlich and Glockshuber, J. Biol. Chem.. 2fii: 24547-24550 (1993). 

20 Further, WUnderlich and Glockshuber show no increase in total accumulation of the secreted a-amylase/trypsin 
inhibitor protein. 

WUlfing and Pltlckthun, Molecular Microbiology . 12: 685-692 (1994) produced functional fragments 
of a T-cell receptor (TCR) in the periplasm of E. coli by modest overproduction of DsbA and the £. co// heat- 
shock proteins at low temperature. The latter was achieved by over-expression of rpoH, which codes for the 

25 heat-shock sigma factor, sigma-^^. This increased the folding yield of the TCR fragments in the periplasm by 
about two orders of magnitude. It is not known what the yield is in the absence of rpoH overproduction. U.S. 
Pat. Nos. 5,270,181 and 5,292,646 disclose recombinant production of heterologous proteins by expression as 
a fusion protein with a thioredoxin-like protein (such as the thioredoxin-like domain of PDI) for high stability 
and solubility. JP 60-38771 published Februazy 15, 1994 discloses the expression of a human PDI gene linked 

30 to human serum albumin pre-pro sequence and co-expression of this linked gene and a foreign gene encoding 
a polypeptide. WO 93/25676 published December 23, 1993 discloses the production of disulfide-bonded 
recombinant proteins using a PDI, preferably a yeast PDI. EP 293,793 published December 7, 1988 discloses 
a polypeptide with PDI activity ensuring natural disulfide bridge arrangement in recombinant proteins. WO 
94/08012 published April 14, 1994 discloses increasing secretion of over-expressed gene products by co- 

35 expression of a che^erone protein such as a heat-shock protein or PDI. EP 509,841 published October 21, 1992 
discloses increased secretion of human serum albumin from yeast cells using a co-expression system involving 
PDI and a protein. U.S. Pat. No. 5,304,472 discloses use of bacterial PstS mutants in the production of 
heterologous polypeptides. 

There is a continuing need for increasing the total yield of proteins secreted by prokaryotes. 
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Thmfore, it is an object of this invention to provide a method for increasing polypeptide yield using 
an economically viable method involving physiologically regulating the intracellular environment for enhanced 
accumulation of foreign polypeptides in bacteria. 

This object and other objects will become apparent to those skilled in the art 

5 g^mmaTy of the fnv^tion 

Accordingly, in one aspect the present invention provides a process for producing a heterologous 
polypeptide in bacteria comprising: 

(a) culturing bacterial cells, which ceils comprise nucleic acid encoding a DsbA or DsbC protein, 
nucleic acid encoding the heterologous polypeptide, a signal sequence for secretion of both the DsbA or DsbC 

1 0 protein and the heterologous polypeptide, and an inducible promoter for both the nucleic acid encoding the DsbA 
or DsbC protein and the nucleic acid encoding the heterologous polypeptide, under conditions whereby 
expression of the nucleic acid encoding the DsbA or DsbC protein is induced prior to induction of the expression 
of the nucleic acid encoding the heterologous polypeptide, and under conditions whereby either both the 
heterologous polypeptide and the DsbA or DsbC protein are secreted into the periplasm of the bacteria or the 

15 heterologous polypeptide is secreted into the medium in which the bacterial cells are cultured; and 

(b) recovering the heterologous polypeptide from the periplasm or the culture medium. 

In another embodiment, the invention provides a process for producing a heterologous polypeptide in 
bacteria comprising: 

(a) culturing bacterial cells that lack their native psiS gene and comprise nucleic acid encoding a PstS 
20 variant having an amino acid variation within the phosphate-binding region of the corresponding native PstS^ 

nucleic acid encoding a DsbA or DsbC protein, nucleic acid encoding the heterologous polypeptide, a signal 
sequence for secretion of both the DsbA or DsbC protein and the heterologous polypeptide, an inducible 
promoter for the nucleic acid encoding die DsbA or DsbC protein, and an alkaline phosphatase promoter for the 
nucleic acid encoding the heterologous polypeptide, wherein the nucleic acid encoding a PstS variant is under 

25 the transcriptional control of the wild-type ps(S gene promoter, and wherein the culturing takes place under 
condhions whereby expression of the nucleic acid encoding the DsbA or DsbC protein is induced prior to 
induction of the expression of the nucleic acid encoding the heterologous polypeptide, under conditions whereby 
the culturing takes place in a culture medium at a concentration of inorganic phosphate in the medium that during 
all phases of cell growth is above the level at which the cells are starved for phosphate, and under conditions 

30 whereby eitiier both the heterologous polypeptide and the DsbA or DsbC protein are secreted into the periplasm 
of the bacteria or the heterologous polypeptide is secreted into the medium in which the bacterial cells are 
cultured; and 

(b) recovering the heterologous polypeptide from the periplasm or the culture medium. 

The over-expression and secretion of the bacterial protein DsbA or DsbC results in a significant increase 
35 in the amount of heterologous polypeptide produced in the periplasm of bacteria or in the culture medium. In 
the specific example shown below, the over-expression and secretion of the £. coli protein DsbA.resulted in a 
large increase in the total yield of human IGF-I deposited in inclusion bodies in the periplasmic space of £1 colL 
Furthermore, this total yield increase is accomplished by culturing in the absence of any glutathione in the 
medium and without co-expressing or over-expressing the E. coli heat-shock transcription factor, RpoH. 
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While DsbA has been suggested to be involved in the folding of peripiasmic and outer wall proteins 
of £ colij it was not known whether over-expression of the gene encoding this protein would have any effect, 
by itself, on secreted human proteins. It was also not known if over-expression of the dsbA gene would result 
in higher levels of DsbA-assoctated activity. If any effect were seen, it was expected that higher levels of DsbA 
S activity would promote the proper folding of IGF-I polypqitide and resuh in higher levels of folded, soluble IGF- 
1 monomer. In contrast, a significant increase was observed in total yield of IGF-I, due to an increase in Ae 
deposition of IGF-I polypeptide into insoluble aggregates, and the yield of soluble, properly folded protein 
decreased. Due to the fact that efficient methods exist for folding these insoluble aggregates into bioactive 
polypeptide, this increase in insoluble polypeptide is a very useful result. 
10 Further, significantly higher titers are obtained when a host with a psiS mutation and an active phoA 

gene is combined with DsbA overexpression for the production of heterologous polypeptide. 

Prirfp^smption of th? Pravying? 

Figure 1 depicts plasmids employed in the construction of pJJ42, used to produce IGF-I, namely 
pBKIGF-2B, pJJ41, and pKMtacII. 
15 Figure 2 depicts the construction of plasmid pJJ40, used to produce IGF-I, 

Figure 3 depicts the construction of plasmid pBKIGF2B-A, used to produce IGF-I. 
Figure 4 depicts the construction of plasmid pJJlOl, used to produce IGF-I. 
Figure S shows the time course of IGF-I polypeptide accumulation for five fermentation experiments 
entitled SI1312 (open circles), 811306 (diamonds), SI1329 (solid circles), SI1339 (diamonds), and SI1341 
20 (squares). Table 4 contains further details on each run. 

Pgtailgd Pg^grlptiftn pf the Prgferrg4 gmbQ(jiments 

A. Definitions 

As used herein, "dsbA" and "dsbC* refer to genes encoding the bacterial peripiasmic proteins known 
in the literature as DsbA or DsbC proteins, respectively. They may be from any bacterial source, with one 

25 example being the dsbA gene from £. coU as described by Bardwell et aL, supra (1991) and Kamitani et al, 
supra, and the dsbC gene from E. coli as described by Missiakas et aL, 1994, supra. The dsbA and dsbC genes 
and their products as used herein do not include PDI or any other eukaryotic protein that functions as a protein 
disulfide isomerase, including those disclosed in U.S. Pat. Nos. 5,270,181 and 5,292,646; JP 60-38771; WO 
93/25676; EP 293.793; WO 94/08012; and EP 509,84 1 , all supra. DsbA and DsbC differ in many respects j&om 

30 PDL For example, DsbA and DsbC retain their isomerase activity in the absence of redox buffer, whereas PDI 
requires a redox buffer for its isomerase activity. Further, DsbA and DsbC have molecular weights by sequence 
analysis of 21 kD and 25 kD, respectively, and contain two and four cysteine residues, respectively, whereas PDI 
has a molecular weight by sequence analysis of 57 kD and contains six cysteine residues. 

As used herein, "signal sequence" or "signal polypeptide" refers to a peptide that can be used to secrete 

35 the heterologous polypeptide into the periplasm or medium of the cultured bacteria or to secrete the DsbA or 
dsbC protein into the periplasm. The signals for the heterologous polypeptide may be homologous to the 
bacteria, or they may be heterologous, including signals native to the polypeptide being produced in tfie bacteria. 
For the dsbA and dsbQ the signal sequence is typically that which is endogenous to the bacterial cells, although 
it need not be as long as it is effective for its purpose. 
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An "over-expressed" gene product is one that is expressed at levels greater than normal endogenous 
expression for that gene product It can be accomplished, e.g., by introducing a recombinant construction that 
directs expression of a gene product in a host cell, or by altering basal levels of expression of an endogenous 
gene product, e.g., by inducing its transcription. 
5 • Hie promoters of this invention are "inducible" promoters, i.e., promoters which direct transcription 

at an increased or decreased rate upon binding of a transcription factor. 

"Transcription factors" as used herein include any foctors that can bind to a regulatory or control region 
of a promoter and thereby effect transcription. The synthesis or the promoter binding ability of a transcription 
fiactor within the host cell can be controlled by exposing the host to an "inducer" or removing an inducer from 
10 the host cell medium. Accordingly, to regulate expression of an inducible promoter, an inducer is added or 
removed from the growth medium of the host cell. As used herein, the phrase "induce expression" means to 
increase the amount of transcription from specific genes by exposure of the cells containing such genes to an 
effector or inducer. 

An "inducer" is a chemical or physical agent which, when given to a population of cells, will increase 

15 the amount of transcription from specific genes. These arc usually small molecules whose effects are specific 
to particular operons or groups of genes, and can include sugars, phosphate, alcohol, metal ions, hormones, heat, 
cold, and the like. For example, isopropylthio-p-galactoside (IPTG) and lactose are inducers of the tacll 
promoter, and L-arabinose is a suitable inducer of the arabinose promoter. The pho gene promoter, such as phoA 
and phoSy is inducible by low phosphate concentrations in the medium. 

20 As used herein, "polypeptide" or "polypeptide of interest" refers generally to peptides and proteins 

having more than about ten amino acids. Preferably, the polypeptides are "exogenous," meaning diat they are 
^lieteroiogous," f.e, foreign to the host cell being utilized, such as a human protein produced by a CHO cell, or 
a yeast polypeptide produced by a mammalian cell, or a human polypeptide produced firom a human cell line that 
is not the native source of the polypeptide. 

25 Examples of mammalian polypeptides include molecules such as, eg., renin, a growth hormone, 

including human growth hormone; bovine growth hormone; growth hormone releasing factor; parathyroid 
hormone; thyroid stimulating hormone; lipoproteins; a 1 -antitrypsin; insulin A-chain; insulin B-chain; proinsulin; 
thrombopoietin; follicle stimulating hormone; calcitonin; luteinizing hormone; glucagon; clotting factors such 
as factor VIIIC, factor EX, tissue fector, and von Willebrands factor; anti-clotting factors such as Protein C; atrial 

30 naturietic factor; lung surfactant; a plasminogen activator, such as urokinase or human urine or tissue-type 
plasminogen activator (t-PA); bombesin; thrombin; hemopoietic growth factor; tumor necrosis factor-alpha and 
-beta; enkephalinase; a serum albumin such as human serum albumin; mullerian-inhibiting substance; relaxin A- 
chain; relaxin B-chain; proreiaxin; mouse gonadotropin-associated peptide; a microbial protein, such as beta- 
lactamase; Dnase; inhibin; activin; vascular endothelial growth factor (VEGF); receptors for hormones or growth 

35 factors; integrin; protein A or D; rheumatoid factors; a neurotrophic factor such as brain-derived neurotrophic 
factor (BDNF), neurotrophin-3, -4, -5. or -6 (NT-3, NT-4, NT-5, or NT-6), or a nerve growth factor such as 
NGF-P; cardiotrophins (cardiac hypertrophy faaor) such as cardiotrophin-l (CT-l); platelet-derived growth 
factor (PDGF); fibroblast growth factor such as aFGF and bFGF; epidermal growth factor (EGF); transforming 
growth fector (TGF) such as TGF-alpha and TGF-beta, including TGF-p I . TGF-p2, TGF-p3, TGF-P4, or TGF- 
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p5; insulin-like growth factor-l and -II (IGF-I and IGF-II); des(l-3>IGF-I (brain IGF-I), insulin-like growdi 
factor binding proteins; CD proteins such as CD-3, CD-4, CD-8, and CD- 19; erytiiropoietin; osteoinductive 
factors; imniunotoxins; a bone morphogenetic protein (BMP); an interferon such as interferon-alpha, 
-beta, and -gamma; colony stimulating factors (CSFs), e.g., M-CSF. GM-CSF, and G-CSF; interleukins (ILs), 
5 eg., IL-1 to IL-10; anti-HER-2 antibody; superoxide dismutase; T-celi receptors; surface membrane proteinsf 
decay accelerating factor, viral antigen such as, for example, a portion of the AIDS envelope; transport proteins; 
homing receptors; addressins; regulatory proteins; antibodies; and fragments of any of the above-listed 
polypeptides. 

The preferred exogenous polypeptides of interest are mammalian polypeptides. Examples of such 
1 0 mammalian polypeptides include t-PA, gp 1 20, anti-HER-2, DNase, IGF-I, IGF-II, brain IGF-I, growth hormone, 
relaxin chains, growth hormone releasing factor, insulin chains or pro-insulin, urokinase, immunotoxins, 
neurotrophins, and antigens. Particularly preferred mammalian polypeptides include, e.g., t-PA, gpl20(IIIb), 
anti-HER-2, DNase, IGF-I, IGF-O, growth hormone, NGF, NT-3, NT-4, NT-5, and NT-6, more preferably IGF, 
most preferably IGF-1, and the most preferred mammalian polypeptide is a human polypeptide. 
15 As used herein, "IGF-I" refers to insulin-like growth factor from any species, including bovine, ovine, 

porcine, equine, and preferably human, in native sequence or in variant form and recombinantiy produced. In 
a preferred method, the IGF-1 is cloned and its DNA expressed, e.g.y by the process described in EP 128,733 
published December 19, 1984. 

The term *'PstS^ refers to the protein encoded by the pstS gene found in bacterial cells, especially 
20 Enterobacteriaceae cells, including K coli cells. This protein is known as the phosphate-binding protein of the 
bacterial cells and contains a phosphate-binding region. 

The "phosphate-binding region" is the region of the protein that binds to inorganic phosphate. This 
region includes the domain wherein hydrogen bonds form between the two molecules. In E, coli PstS this region 
is: 

25 Sidg Chains Pacl(t)one 

ThrlO ThrlO 

Ser38 Thr 141 

Asp 56 Ser38 

Arg 135 Gly 140 

30 Serl39 Phell 

Thr 141 

This region also includes other residues that indirectly affect phosphate binding, the associated conformational 
shift from open to closed complex when the phosphate is bound, and/or the associated signaling pathway. Thus, 
mutations in residues of PstS not directly contacting the phosphate (or proteins truncated by stop codons or 
35 frame-shifts) may have similar phenotypes to mutations in residues of PstS that bind directly to the phosphate. 

"PstS variants" are defined as molecules in which the amino acid sequence of the corresponding native 
(wild-type) PstS protein has been modified (by either a predetermined or random mutation) in the phosphate- 
binding region thereof, in such a way that the PstS protein no longer functions as a repressor at levels of 
phosphate greater than about 10 fimolar. Furlong, "Osmotic-Shock-Sensitive Transport Systems," in Neidhardt 
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FC cf al. (eds.) Escherichia coli and Salmonella tvphimurium: Cellular and Molecular Biology. Vol. 1 (American 
Society for Microbiology, Washington, DC, 1987) p. 768-796, particularly p. 772-773. Thus, the mutation 
reduces the affinity of the binding protein for phosphate. Amino acid sequence variants of PstS include, for 
example, deletions from, or insertions or substitutions of, residues within the PstS amino acid sequence shown 
5 in Figure 1 of U.S. 5,304,472. Any combination of deletion, insertion, and substitution may also be made to 
arrive at the final construct, provided that the final construct possesses the desired property of allowing 
polypeptide induction by the bacterial host cell at concentrations of phosphate in the medium that are above the 
starvation level. 

The phrase "hydrophobic residues" refers to the residues norleucine, cysteine, methionine, alanine, 
10 valine, leucine, tyrosine, phenylalanine, tryptophan, and isoleucine. 

The expression "control sequences" refers to DNA sequences necessary for the expression of an 
operably linked coding sequence in a particular host organism. The control sequences that are suitable for 
bacteria include a promoter such as the alkaline phosphatase promoter, optionally an operator sequence, and a 
ribosome-binding site. 

15 Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 

acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 

20 "operably linked" means that the DNA sequences being linked are contiguous and, in the case of a secretory 
leader, contiguous and in reading phase. Linking is accomplished by ligation at convenient restriction sites. If 
such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional 
practice. 

As used herein, the expressions "cell," "cell line," and "cell culture" are used interchangeably and all 
25 such designations include progeny. Thus, the words "transfomiants" and "transfomned cells" include the primary 
subject cell and cultures derived therefrom without regard for the number of transfers. It is also understood that 
all progeny may not be precisely identical in DNA content, due to delibemte or inadvertent mutations. Mutant 
progeny that have the same function or biological activity as screened for in the originally transformed cell are 
included. Where distinct designations are intended, it will be clear from the context. 
30 "Production phase of cell growth" refers to the period of time during cell growth following induction 

' of the promoter when the polypeptide of interest is being produced. 

"Partially induced" as it applies to the alkaline phosphatase promoter refers to a state in which complete 
induction of the alkaline phosphatase promoter is not achieved, but rather only partial induction thereof. In this 
way, the rate of transcription of the nucleic acid desired to be expressed is controlled. 
35 As used herein, "phosphate starvation level" or "level at which cells are starved for phosphate" refers 

to a concentration of inorganic phosphate (e.g., salts of phosphoric acid such as sodium phosphate, potassium 
phosphate, or phosphate associated with complex nitrogen sources such as casein hydrolysates or yeast extracts, 
etc) in the culture medium so low that the cells are considered deprived of the phosphate ions, leading to a 
reversible increase in the rate of protein degradation, an inhibition of RNA synthesis, a decrease in cell growth, 
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and a decrease in ATP. This is desaibed further in SL John and Goldberg, J, BacterioK. HJ: 1223-1233 (1980). 
This starvation level is to be distinguished fix)m the level of phosphate required for inductionA^ression of the 
phoA promoter. Complete starvation is not required to induce this promoter. pstS is tihought to be the cells' 
sensor of phosphate levels and thmfbre indirectly induces phoA expression. The desired inorganic phosphate 
S concentration for inducing polypeptide production will depend on such factors as the type of polypeptide being 
produced, the type of host cell, the type of medium, and the culture conditions employed. An exemplary 
concentration for this purpose is O.NIO pmolar. 

The technique of "polymerase chain reaction," or "PCR," as used herein generally refers to a procedure 
wherein minute amounts of a specific piece of nucleic acid, RNA, and/or DNA, are amplified as described in 

10 U.S. Pat No. 4,683,195 issued 28 July 1987. Generally, sequence information from the ends of the region of 
interest or beyond needs to be available, such that oligonucleotide primers can be designed; these primers will 
be identical or similar in sequence to opposite strands of the template to be amplified. The 5'-termmai 
nucleotides of the two primers may coincide with the ends of the amplified material. PCR can be used to amplify 
specific RNA sequences, specific DNA sequences from total genomic DNA, and cDNA transcribed from total 

1 5 cellular RNA, bacteriophage or plasmid sequences, etc. See generally Mullis et ai , Cold Soring Harbor Svmp. 
Quant. Biol. . 51: 263 (1987); Eriich, ed., PCR Technology. (Stockton Press, NY, 1989), For a recent review 
on PCR advances, see Eriich et ai. Science . 252 : 1643-1650 (1991). 

As used herein, PCR is considered to be one, but not the only, example of a nucleic acid polymerase 
reaction method for amplifying a nucleic acid test sample comprising the use of a known nucleic acid as a primer 

20 and a nucleic acid polymerase to amplify or generate a specific piece of nucleic acid. 
B. MQtfg^ fpr Carrying Out the jnygmipn 

In the process herein, expression of the dsbA or dsbC gene is induced just before (immediately prior 
to) heterologous gene expression. The heterologous polypeptide and DsbA or DsbC protein are both secreted 
into the periplasm or the heterologous pot3^eptide is secreted into the culture medium of the bacteria into which 

25 nucleic acid encoding these polypeptides has been introduced. Preferably, the polypeptide is recovered from 
the periplasm of the bacteria. 

The dsbA or dsbC nucleic acid may be cDNA or genomic DNA from any bacterial source, and is 
generally the native sequence. It is suitably separately placed from the nucleic acid encoding the heterologous 
polypeptide if the nucleic acids are on the satne vector, i.e., they are not linked. In addition, the nucleic acid 

30 encoding DsbA or DsbC and the nucleic acid encoding the heterologous polypeptide are under separate, 
differently inducible promoters so that induction of expression can occur in the required sequential order. The 
nucleic acid encoding the DsbA or DsbC and the nucleic acid encoding the heterologous polypeptide may be 
integrated into the host cell genome or contained on autonomously replicating plasmids. 

If the DsbA or DsbC are native products of the host cell, and if the factors controlling expression of 

35 these native genes are understood, such factors can be manipulated to achieve over-expression of the dsbA or 
dsbC genes, e.g., by induction of transcription from the natural promoter using known inducer molecules, by 
mutation of the nucleic acids controlling or repressing expression of the gene product to produce a mutant strain 
that inductively over-expresses the gene product, by second site mutations which depress the synthesis or 
function of factors that normally repress the transcription of the gene product, and the like. 
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In one alternative, the bacteria comprises two separate vectors respectively containing the nucleic acid 
encoding the DsbA or DsbC protein and the nucleic acid encoding the heterologous polypeptide. 

In another alternative, the nucleic acid encoding the DsbA or DsbC protein and the nucleic acid 
encoding the heterologous polypeptide are contained on the same vector but are under the control of separate 
5 inducible promoters and separate signal sequences. 

The heterologous nucleic acid (e.g, , cDNA or genomic DNA) is suitably inserted into a replicable vector 
for expression in the bacterium under the control of a suitable promoter for bacteria. Many vectors are available 
for this purpose, and selection of the appropriate vector will depend mainly on the size of the nucleic acid to be 
inserted into the vector and the particular host cell to be transformed with the vector. Each vector contains 

1 0 various components depending on its iiinction (amplification of DNA or expression of DNA) and the particular 
host cell with which it is compatible. The vector components for bacterial transforation generally include, but 
are not limited to, one or more of the following: a signal sequence, an origin of replication, one or more marker 
genes, and an inducible promoter. 

In general, plasmid vectors containing replicon and control sequences that are derived from species 

IS compatible with the host cell are used in connection with bacterial hosts. The vector ordinarily carries a 
replication site, as well as marking sequences that are capable of providing phenotypic selection in transformed 
cells. For example, E. coli is typically transformed using pBR322, a plasmid derived from an E. coli species 
(see, Bolivar a/., 2'- 95 [1977]). pBR322 contains genes for ampicillin and tetracycline resistance 
and thus provides easy means for identifying transformed cells. The pBR322 plasmid, or other microbial plasmid 

20 or phage, also generally contains, or is modified to contain, promoters that can be used by the microbial organism 
for expression of the selectable marker genes. 

The DNA encoding the polypeptide of interest herein may be expressed not only directly, but also as 
a fusion with another polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage 
site at the N-terminus of the mature polypeptide. In general, the signal sequence may be a component of the 

25 vector, or it may be a part of the polypeptide DNA that is inserted into the vector. The heterologous signal 
sequence selected should be one that is recognized and processed (i.e., cleaved by a signal peptidase) by the host 
cell. For bacterial host cells that do not recognize and process the native polypeptide signal sequence, the signal 
sequence is substituted by a bacterial signal sequence selected, for example, from the group consisting of the 
alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. 

30 Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 

in one or more selected host cells. Generally, in cloning vectors this sequence is one that enables the vector to 
replicate independently of the host chromosomal DNA, and includes origins of replication or autonomously 
replicating sequences. Such sequences are well known for a variety of bacteria. The origin of replication from 
the plasmid pBR322 is suitable for most Gram-negative bacteria. 

35 Expression and cloning vectors also generally contain a selection gene, also termed a selectable marker. 

This gene encodes a protein necessary for the survival or growth of transformed host cells grown in a selective 
culture medium. Host cells not transformed with the vector containing the selection gene will not survive in the 
culture medium. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, 
e.g., ampicillin, neomycin, metiiotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply 
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critical nutrients not available from complex media, e.g., the gene encoding D-aianine racemase for BacillL One 
example of a selection scheme utilizes a drug to arrest growth of a host cell. Those ceib that are successfully 
transformed with a hetmlogous gene produce a protein conferring drug resistance and thus survive the selection 
regimen. 

5 The expression vector for producing a heterologous polypeptide also contains an inducible promoter 

that is recognized by the host bacterial organism and is operably linked to the nucleic acid encoding the 
polypeptide of interest It also contains a separate inducible promoter operably linked to the nucleic acid 
encoding the DsbA or DsbC protein. Inducible promoters suitable for use with bacterial hosts include the P- 
lactamase and lactose promoter systems (Chang et a!.. Nature. 275 : 615 [1978]; Goeddel e/ a/. Nature. 281 : 544 

10 [1979]), the arabmose promoter system (Guzman ei ai, J. Bacteriol.- 174: 7716-7728 [1992]), alkaline 
phosphatase, a tryptophan (trp) promoter system (Goeddel, Nucleic Acids Res.. £: 4057 [1980] and EP 36,776) 
and hybrid promoters such as the tac promoter (deBoer et al.^ Proc. Natl. Acad. Sci. USA. 21-25 [1983]). 
However, other known bacterial inducible promoters are suitable. Their nucleotide sequences have been 
published, thereby enabling a skilled worker operably to ligate them to DNA encoding the polypeptide of interest 

15 or to the dsbA or dsbC genes (Siebenlist et al.^ Cell. 269 [1980]) using linkers or adaptors to supply any 
required restriction sites. 

Promoters for use in bacterial systems also generally contain a Shine-Dalgamo (S.D.) sequence operably 
linked to the DNA encoding the polypeptide of interest. The promoter can be removed from the bacterial source 
DNA by restriction enzyme digestion and inserted into the vector containing the desired DNA. If the host 

20 contains a pstS variant gene, the expression vector for producing a heterologous polypeptide contains an alkaline 
phosphatase promoter that is recognized by the host bacterial organism and is operably linked to the nucleic acid 
encoding the polypeptide of interest. This promoter initiates increased levels of transcription from DNA under 
its control in response to a decreased concentration of inorganic phosphate in the culture medium. The phoA 
promoter can be removed from the bacterial source DNA by restriction enzyme digestion and inserted into the 

25 vector containing the desired DNA. 

Construction of suitable vectors containing one or more of the above-listed components employs 
standard ligation techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and re-ligated in the 
form desired to generate the plasmids required. 

For analysis to confmn correct sequences in plasmids constructed, the ligation mixtures are used to 

30 transform K coli K12 strain 294 (ATCC 3 1 ,446) or other strains, and successful transformants are selected by 
ampicillin or tetracycline resistance where appropriate, Plasmids from the transformants are prepared, analyzed 
by restriction endonuclease digestion, and/or sequenced by the method of Sanger et al, Proc. Natl. Acad. Sci. 
USA. 24: 5463-5467 (1977) or Messing et ai. Nucleic Acids Res.. 2: 309 (1981) or by die method of Maxam 
etai. Methods in Enzvmologv. 65: 499 (1980). 

35 Suitable bacteria for this purpose include archaebacteria and eubacteria, especially eubacteria, and most 

preferably Enterobacteriaceae, Examples of useful bacteria include Escherichia^ Enterobacter^ Azotobacter, 
Envinia, Bacillus^ Pseudomonas, Klebsiella, Proteus, Salmomlia, Serratia, Shigella, Rhizobia, Vitreoscilla, and 
Paracoccus. Suitable £. coli hosts include E coli W3 1 10 (ATCC 27,325), E. coli 294 (ATCC 3 1 ,446), E, coli 
B,and£ co// XI 776 (ATCC 31,537). These examples are illustrative rather than limiting. Mutant cells of any 
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of the above-mentioned bacteria may also be employed. It is, of course, necessary to select the appropriate 
bactma taking into consideration replicability of the replicon in the cells of a bacterium. For example, £ co/i, 
Serratia^ or Salmonella species can be suitably used as the host when well known plasmids such as pBR322, 
pBR325, pACYC177, or pKN410 are used to supply the replicon. 
5 £1 coli strain W31 10 is a preferred host because it is a common host strain for recombinant DNA 

product fennentations. Preferably, the host cell should secrete minimal amounts of proteolytic enzymes. For 
example, strain W3110 may be modified to effect a genetic mutation in the genes encoding proteins, with 
examples of such hosts including E. coli W3110 strain 1A2, which has the complete genotype tonAA (also 
known as AfliuA)\ £ coli W3 1 10 strain 9E4, which has the complete genotype tonAA ptr3; £ coli W3 1 10 strain 

10 27C7 (ATCC 55,244), whidi has the complete genotype tonAA ptr3 phoAAElS A(argF4ac)l69 ompTA 
degP41kaif ; £ coli W31 10 strain 37D6, which has the complete genotype tonAA ptr3 phoAAElS A(argF' 
lac)169 ompTA degP4Jkat/ rbs7A ihG; £ coli W3110 strain 40B4, which is strain 37D6 with a non- 
kanamycin resistant €iegP deletion mutation; £ coli W3 1 10 strain 33D3, which has the complete genotype tonA 
ptr3 laclq LacL8 ompTdegP karf ; £ coli W31 10 strain 36F8, which has the complete genotype tonA phoA 

1 5 A fargF'lac) ptr3 degP karf^ HvG-^, and is temperature resistant at 37° C; and an £ coli strain having the mutant 
periplasmic protease(s) disclosed in U.S. Pat. No. 4,946,783 issued August 7, 1990. 

Preferably, the host in nature contains a native pstS gene. For the method herein that native gene is 
lacking and is replaced by the pstS variant gene, which is preerably homologous to die native pstS gene normally 
present in the host cells. Suitable bacteria for this purpose include eubacteria, especially Enterobacteriaceae, 

20 where the pstS gene is thus far found. Examples of bacteria belonging to Enterobacteriaceae include 
Escherichia, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, Serratia, and Shigella, One preferred £ 
co/i host is£ co// 294 (ATCC 3 1,446), although other strains such as £ coliB,E. coliXll76 (ATCC 31,537). 
and £ coli W3 110 (ATCC 27,325) are suitable. These examples are illustrative rather than limhing. Mutant 
cells of any of the above-mentioned bacteria may also be employed. It is, of course, necessary to select the 

25 appropriate bacteria taking into consideration replicability of the replicon in the cells of a bacterium. For 
example, £ coli, Serratia, or Salmonella species can be suitably used as the host when well known plasmids such 
as pBR322,pBR325,pACYA 177, or pKN4 10 are used to supply the replicon. £ coli strain W3110 is a 
particularly preferred parent host because it is a common host strain for recombinant DNA product femientations. 
Preferably, the host cell should secrete minimal amounts of proteolytic enzymes. For example, strain W3 1 10 

30 may be modified to effect a genetic mutation in the genes encoding proteases, with examples of such hosts 
including £ coli W3 1 10 strain 9E4, which has the complete genotype tonA ptr3, the preparation of which is 
described below, and £ coli W3l 10 strain 27C7, which has the complete genotype tonAA ptr3 phoAAElS 
A(argF'lac)l69 ompTA degP4lkaf/. Strain 27C7 was deposited on 30 October 1991 in the American Type 
Culture Collection as ATCC No. 55,244. Alternatively, the sd^in of £ coli having mutant periplasmic protease 

35 disclosed in U.S. Pat. No. 4,946,783 issued 7 August 1990 may be employed. 

Host cells are transfected. and preferably transformed with the above-described expression vectors of 
this invention and cultured in conventional nutrient media modified as appropriate for inducing the various 
promoters. 
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Transfection refers to the taking up of an expression vector by a host ceil whether or not any coding 
sequences are in fact expressed. Numerous methods of transfection are known to the ordinarily skilled artisan, 
for example, CaPO^ and electroporation. Successful transfection is generally recognized when any indication 
of the operation of this vector occurs wiAin the host cell. 
5 Transfonnation means introducing DNA into an organism so that the DNA is repiicable, either as an 

extrachromosomal element or by chromosomal integrant. Depending on the host cell used, transformation is 
done using standard techniques appropriate to such cells. The calcium treatment employing calcixmi chloride, 
as described in section 1.82 of Sambrook et aL, MQlP^^^lar ClPfling: A Ubomtorv Many^l [New Yoric: Cold 
Spring Harbor Laboratory Press, 1989], is generally used for bacterial cells that contain substantial cell-wall 
10 barriers. Another method for transformation employs polyethylene giycol/DMSO, as described in Chung and 
Miller, Nucleic Acids Res.. 16: 3580 (1988). Yet another method is the use of the technique termed 
electroporation. 

For purposes of this invention, a PstS variant contains one or more amino acid mutations within its 
phosphate-binding region, and is preferably from £. colL Such variants may be prepared by any means, e.g., 

15 recombinant, synthetic^ or partially synthetic. Amino acid sequence variants of PstS are suitably prepared by 
introducing appropriate nucleotide changes into the pstS DNA. Such variants include, for example, deletions 
from, or insertions or substitutions of, residues within the amino acid sequence shown for £. coli PstS in Figure 
I of U.S. Pat No. 5,304,472. Any combination of deletion, insertion, and substitution is made to arrive at the 
fmal construct, provided diat the final construct possesses the desired characteristics. The variants and their 

20 method of construction are described more fully in U.S. Pat. No. 5,304,472, the disclosure of which is 
incorporated by reference. . For the design of amino acid sequence variants of PstS, the preferred 
induction characteristics will depend on the location of the mutation site within the phosphate-binding region and 
the nature of the mutation. The sites for mutation can be modified individually or in series, e.g., by (1) 
substituting fct with conservative amino acid choices and then with more radical selections depending upon the 

25 results achieved, (2) deleting the target residue, or (3) inserting residues of the same or a different class adjacent 
to the located site, or combinations of options 1-3. 

While the site for introducing an amino acid sequence variation is predetermined, the nature of the 
mutation per se need not be predetermined. For example, to maximize the performance of a mutation at a given 
site, random mutagenesis is suitably conducted at the target codon or region and the expressed PstS variants are 

30 screened for the most favorable induction characteristics. 

Amino acid sequence deletions within the phosphate-binding domain of PstS generally will range from 
about 1 to 5 residues, and typically are contiguous. Contiguous deletions ordinarily are made in even numbers 
of residues, but single or odd numbers of deletions are within the scope hereof. 

Amino acid sequence insertions are intrasequence insertions of smgle or multiple amino acid residues 

35 within the phosphate-bmding domain, generally ranging from about 1 to 5 residues, most preferably 1 to 3. 
Insertions are preferably made in even numbers of residues, but this is not required. 

A third group of variants, which are preferred herein, are amino acid substitution variants. These 
variants have at least one amino acid residue widiin the phosphate-binding region of the native PstS molecule 
removed and a different residue inserted in its place. 
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Substantial modifications in phosphate-binding ability of the PstS protein are accomplished by selecting 
substitutions that differ significantly in their effect on maintaining (a) the structure of the PstS polypeptide 
backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site, or (c) the bulk of the iide-chain. Naturally occurring residues 
S are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, cys, met, ala, val, leu, tyr, phe, tip, ile; 

(2) neutral hydrophilic: ser, thr, 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; and 

10 (5) residues that influence chain orientation: giy, pro. 

Preferably, the variants herein are those in which one or more of the amino acid residues positioned at 
the critical phosphate-binding region of the native counterpart protein is replaced with one or more different 
amino acids. For K colipstS variants, preferably residues thr, pro, or leu are substituted for the serine residue 
at position 139, residue his is substituted for the threonine residue at position 141, residues phe, leu, met, tyr, 

15 ala, cys, or gly are substituted for the threonine residue at position 10, and/or residues val, ala, leu, or ser are 
substituted for the asparagine residue at position 56 of native PstS. The most preferred £. coli PstS variants 
herein are those wherein a hydrophobic amino acid is substituted for the threonine residue at position 10, most 
preferably TIOM PstS and TlOY PstS, and the variants D56S PstS and T141H PstS, using die nomenclature 
indicated below. Such amino acid changes may also be combined to provide a variant molecule with more than 

20 one altered amino acid. 

Nucleic acid molecules encoding amino acid sequence variants of PstS are prepared by a variety of 
methods known in the art. These methods include, but are not limited to, preparation by oligonucleotide- 
mediated (or site-directed) mutagenesis, alanine-scanning mutagenesis, random mutagenesis, PCR mutagenesis, 
and cassette mutagenesis of an earlier prepared variant or a non-variant version of PstS, 

25 Oligonucleotide-mediated mutagenesis represents a preferred method for preparing substitution, 

deletion, and insertion variants of the pstS gene, although other methods may be utilized as desired. This 
technique is well known in the art as described by Zoller and Smith, Nucleic Acids Res. . )Q: 6487 (1982). 
Briefly, jnf/5DN A is altered by hybridizing an oligonucleotide encoding the desired mutation to a DNA template, 
where the template is the single-stranded form of a plasmid or bacteriophage containing the unaltered or native 

30 DNA sequence of pstS, After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code for the 
selected alteration in the pstS DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. A preferred oligonucleotide 
will have 12 to 15 nucleotides that are completely complementary to the template on either side of the 

35 nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to die 
single-stranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known 
in the art such as that described by Crea et ai, Proc. Natl. Acad. Sci. USA. 25: 5765 (1978). 

The DNA template can be generated by those vectors that are either derived from bacteriophage Ml 3 
vectors (the commercially available M13mpl8 and M13mpl9 vectors are suitable), or those vectors that contain 
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a single-stranded phage origin of replication as described by Viera et al., Meth. EnzvmoL. 153 : 3 (1987). Thus, 
the DNA that is to be mutated may be inserted into one of these vectors to generate single-stranded template. 
Production of the single-stranded template is described in Sections 4.2M.41 of Sambrook et al.^ Molecular 
Cloninp: A Laboratory Manual (Cold Spring Harbor Laboratory Press; NY 1989). 
S Alternatively, a single-stranded DNA template may be generated by denaturing double-stranded plasmid 

(or other) DNA using standard techniques. 

A useful method for identification of certain residues or regions of the PstS protein that are preferred 
locations for mutagenesis is called "alanine-scanning mutagenesis," as described by Cunningham and Wells, 
Science. 244 : 1 08 1 - 1 085 ( 1989). Here, a residue or group of target residues are identified (e.g. , charged residues 

10 such as arg, asp, his, iys, and glu) and replaced by a neutral or negatively charged amino acid (most preferably 
alanine or polyalanine) to affect the interaction of die amino acids with the surrounding aqueous environment 
in or outside the cell. Those domains demonstrating functional sensitivity to the substitutions then are refined 
by introducing further or other variants at or for the sites of substitution. Thus, while the site for introducing an 
amino acid sequence variation is predetermined, the nature of the mutation per se need not be predetermined. 

1 5 For example, to optimize the performance of a mutation at a given site, alanine scanning or random mutagenesis 
is conducted at the target codon or region and the expressed FstS variants are screened for the most preferred 
combination of desired activity. 

For alteration of the native DNA sequence (to generate amino acid sequence variants, for example), the 
preferred method is the combination of oligonucleotide-directed mutagenesis and random mutagenesis as 

20 described by Kunkel et al^ Methods Enzymol.. 154 : 367 (1987). In this method, oligonucleotide-directed 
mutagenesis is employed to randomize particular codons of the wild-type pstS gene to encode all possible 
residues. A pool of oligonucleotides with complementary sequence (about 10-15 bases) flanking the codon of 
choice is used. The codon of choice is replaced with the nucleotides NNS, where N is any nucleotide and S is 
G or C, to give a pool of oligonucleotides encoding all possible amino acids in 32 codons. 

25 In this preferred method, a pBR322-derived plasmid with a single-stranded origin of replication is 

prepared as a single-stranded plasmid template in an £1 colidut- ung- strain such as CJ236 (Kunkel et ai, supra). 
These two mutations in the strain cause the incorporation of one or more uracil nucleotides into the single- 
stranded DNA instead of thymine. The random oligonucleotides are annealed, filled in with £' coH phage T7 
DNA polymerase, ligated, and transformed into a wild-type strain of E, coli such as W31 10 or strain 13G8 

30 (W3 1 10 tonA^ PhoS64). The latter strain is pstS minus and derived from CGSC6777 (C75-b), which is derived 
from C75, described by Amemura et al, J. Bacter. . 152 : 692-701 (1982). The wild-type strain corrects the uracil 
misincorporation using the synthetic mutant strand as a template so as to produce about 90% mutants. 

DNA encoding PstS mutants with more than one amino acid to be substituted may be generated in one 
of several ways. If the amino acids are located close together in the polypeptide chain, they may be mutated 

35 sunultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions. If, however, 
the amino acids are located some distance from each other (separated by more than about ten amino acids), it 
is more difficult to generate a single oligonucleotide that encodes all of the desired changes. Instead, one of two 
alternative methods may be employed. 
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In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The 
oligonucleotides are then annealed to the single-stranded templete DNA simultaneously, and the second strand 
of DNA that is synthesized from die template will encode all of the desired amino acid substitutions. The 
ahemative method involves two or more rounds of mutagenesis to produce the desired mutant. The first round 
S is as described for the single mutants: wild-type DNA is used for the template, an oligonucleotide encoding the 
first desired amino acid substitution(s) is annealed to this template, and the heteroduplex DNA molecule is ttien 
generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of 
mutagenesis as the template. Thus, diis template already contains one or more mutations. The oligonucleotide 
encoding the additional desired amino acid substitution(s) is then annealed to this template, and the resuhing 

10 strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant 
DNA can be used as a template in a third round of mutagenesis, and so on. 

PGR mutagenesis is also suitable for making amino acid variants of PstS. While the following 
discussion refers to DNA, it is understood that the technique also fmds application with RNA. The PGR 
technique generally refers to the following procedure (see Erlich, supra^ the chapter by R. Higuchi, p. 61-70): 

15 When small amounts of template DNA are used as starting material in a PGR, primers that differ slightly in 
sequence from the corresponding region in a template DNA can be used to generate relatively large quantities 
of a specific DNA fragment that differs from the template sequence only at die positions where the primers differ 
from the template. For introduction of a mutation into a plasmid DNA, one of the primers is designed to overiap 
the position of the mutation and to contain the mutation; the sequence of the other primer must be identical to 

20 a stretch of sequence of the opposite strand of the plasmid, but this sequence can be located anywhere along the 
plasmid DNA. It is preferred, however, that the sequence of the second primer is located within 200 nucleotides 
from that of the fbst, such that in the end the entire amplified region of DNA bounded by the primers can be 
easily sequenced. PGR amplification using a primer pair like the one just described results in a population of 
DNA fragments that differ at the position of the mutation specified by the primer, and possibly at other positions, 

25 as template copying is somewhat error-prone. 

If the ratio of template to product material is extremely low, the vast majority of product DNA 
fragments incorporate the desired mutation(s). This product material is used to replace the corresponding region 
in the plasmid that served as PGR template using standard DNA technology. Mutations at separate positions can 
be introduced simultaneously by either using a mutant second primer, or performing a second PGR with different 

30 mutant primers and ligating the two resulting PGR fragments simultaneously to the vector fragment in a three 
(or more)-part ligation. 

In a specific example of PGR mutagenesis, template plasmid DNA (1 ^g) is linearized by digestion with 
a restriction endonuclease that has a unique recognition site in the plasmid DNA outside of the region to be 
amplified. Of this material, 100 ng is added to a PGR mixture containing PGR buffer, which contains the four 
35 deoxynucleotide triphosphates and is included in the GeneAmp® kits (obtained from Perkin-Ebner Getus, 
Norwalk, GT and Emeryville, GA), and 25 pmole of each oligonucleotide primer, to a final volume of 50 ^1. 
The reaction mbcture is overlayed with 35 ^1 mineral oil. The reaction mixture is denatured for five minutes at 
100°G, placed briefly on ice, and then 1 ^1 Thermus aquaticus (Tag) DNA polymerase (5 units/fil, purchased 
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from Perkin-Elmer Cetus) is added below the mineral oil layer. The reaction mixture is then insated into a DNA 
thermal Cycler (purchased from Perkin-Eimer Cetus) progranmied as follows: 
2min.55"C 

30 sec. 72^C, then 19 cycles of the following: 
5 30sec.94«C 

30 sec. 55° C, and 
30 sec. 72** C. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous phase 
transferred to a new vial, extracted with phenol/chloroform (50:50 vol), and ethanol precipitated, and die DNA 
10 is recovered by standard procedures. This material is subsequently subjected to the appropriate treatments for 
insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique described by 
Wells et ai^ Gene, 3 1 5 (1985). The starting material is the plasmid (or other vector) comprising the pstS 
DNA to be mutated. The codon(s) in the pstS DNA to be mutated are identified. There must be a unique 

15 restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, they 
may be generated using the above-described oligonucleotide-mediated mutagenesis method to introduce them 
at appropriate locations in the pstS DNA. After the restriction sites have been introduced into the plasmid, the 
plasmid is cut at these sites to linearize it A double-stranded oligonucleotide encoding the sequence of the DNA 
between the restriction sites but containing the desired mutation(s) is synthesized using standard procedures. The 

20 two strands are synthesized separately and then hybridized together using standard techniques. This 
double-stranded oligonucleotide is referred to as the cassette. This cassette is designed to have 3* and 5' ends 
that are compatible with the ends of the i 'mearized plasmid, such that it can be directly Hgated to the plasmid. 
This plasmid now contains the mutated pstS DNA sequence. 

Nucleic acid encoding the PstS variant may also be chemically synthesized and assembled by any of 

25 a number of techniques, prior to expression in a host ceil. [See, eg., Caruthers, U.S. Patent No. 4,500,707; 
Balland et aL, Biochimie. j^Z: 725-736 (1985); Edge et ai, Nature. 292 : 756-762 (1982)]. 

For purposes of shorthand designation of the PstS variants described herein, it is noted that numbers 
refer to the amino acid residue/position along the amino acid sequences of the mature PstS protein. Amino acid 
identification uses the single-letter alphabet of amino acids, 



30 


Asp 


D 


Aspartic acid 


He 


I 


Isoleucine 




Thr 


T 


Threonine 


Leu 


L 


Leucine 




Ser 


S 


Serine 


Tyr 


Y 


Tyrosine 




Glu 


E 


Glutamic acid 


Phe 


F 


Phenylalanine 




Pro 


P 


Proline 


His 


H 


Histidine 


35 


Gly 


G 


Glycine 


Lys 


K 


Lysine 




Ala 


A 


Alanine 


Aig 


R 


Arginine 




Cys 


C 


Cysteine 


Trp 


W 


Tryptophan 




Val 


V 


Valine 


Gin 


Q 


Glutamine 




Met 


M 


Methionine 


Asn 


N 


Asparagine 
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The designation for a substitution variant herein consists of a letter followed by a number followed by 
a letter. The first (leftmost) letter designates the amino acid in the wild-type, mature PstS protein. The number 
refers to the amino acid position where the amino acid substitution is being made, and the second (right-hand) 
letter designates the amino acid that is used to replace the wild-type aminb acid. The designation for an insertion 
5 variant consists of the letter i followed by a number designating the position of the residue in wild-type, mature 
PstS protein before which the insertion starts, followed by one or more capital letters indicating, inclusively, the 
insertion to be made. The designation for a deletion variant consists of the letter d followed by the number of 
the start position of the deletion to the number of the end position of the deletion, with the positions being based 
on the wild-type, mature PstS protein. Multiple mutations are separated by a comma in the notation for ease of 
10 reading them. 

Examples of the nomenclature for £. coii PstS protein are as follows: a substitution variant where &e 
threonine at position 10 of the wild-type PstS protein is replaced with a methionine residue is designated TIOM 
PstS, A substitution variant with multiple substitutions M and S at positions 10 and 56 of the wild-type PstS 
protein is designated T10M,D56S PstS. An insertion variant where cysteine and valine are inserted after the 

15 threonine at position 10 of wild-type PstS is designated iTl OCV PstS. A deletion variant where the amino acid 
(threonine) at position 10 is deleted from the wild-type, mature PstS is designated dTlO PstS. As indicated in 
the above examples, the notation ^PstS follows after each mutant. 

Most deletions and insertions, and substitutions in particular, are not expected to produce radical 
changes in the characteristics of the PstS molecule. However, when it is difficuh to predict the exact effect of 

20 the substitution, deletion, or insertion in advance of doing so, one skilled in the art appreciates that die effect may 
be evaluated by routine screening assays. 

A DNA variant typically may be made by random and/or site-specific mutagenesis of the native PstS- 
encoding nucleic acid and transfection or integration of the pstS variant gene into the chromosomes of a bacterial 
host, or by random mutagenesis of a host containing the native pstS gene. The nucleic acid variant may then be 

25 screened in a suitable screening assay for the desired characteristic. 

For example, in one embodiment, the mutant genes are screened for alkaline phosphatase activity in high 
phosphate (about 3 mM phosphate) by transforming the mutant pool into a pstS- W31 10 strain such as strain 
13G8 described above or C75 (Amemura et ai., supra) and plating on LB-BCIP-carbenicillin plates. Plasmids 
are isolated from blue colonies and the pstS gene is then sequenced to determine specific mutations. 

30 Alternatively, single transformants are grown overnight in duplicate 96-well plates containing either 

high (2-5 mM) or low (0.3 mM) phosphate. Then aliquots are assayed for PhoA activity by PNPP hydrolysis. 
Mutants with increased activity, particular those with higher activity in high phosphate concentration, are 
sequenced and further characterized for activity. 

Sequenced mutants selected from either screen are further characterized by growing cells in 96-well 

35 plates containing a range of phosphate concentrations from 0.02 to 50 mM and assaying aliquots for PhoA 
activity. The mutants selected from this screening are integrated into the chromosomal pstS locus to replace the 
wild-type gene and to be under the transcriptional control of the wild-type PstS promoter. The integrated strains 
are characterized by assaying PhoA levels. 
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Those strains meeting this last test can then be evaluated in shake-flask cultures with varying initial 
phosphate concentrations to assay for the expression of either homologous or heterologous protein products as 
appropriate. In addition or alternatively, the new organisms can be evaluated in high-cell-density fermentations 
where different phosphate feeding rates are employed after the initially dtarged phosphate is depl^ed. As a final 
5 test and optimization, these strains can then be used in a fermentor where phosphate levels can be monitored and 
regulated on-line. The cells are transformed with the nucleic acid encoding this polypeptide and the cells will 
be assayed for levels of that polypeptide produced under transcriptional control of the specific promoter 
employed, e.g., the phoA promoter. 

In die latter application where the phoA promoter is used, the phosphate level of the medium in which 

10 the altered bacterial host cells are cultured is measured on line (i.e., by continuous sampling), starting with an 
excess of phosphate (40 mM) in the medium. Then the phosphate level is depleted to about 0.2 to 5 mM 
phosphate, and the rate of induction of the PhoA promoter is measured by techniques known to those skilled in 
the art. The preferred PstS mutants are those wherein the polypeptide induction at this concentration of 
phosphate increases the ultimate yield of polypeptide or increases the relative amount of intact polypeptide or 

IS the cell density. 

If the nucleic acid encoding the PstS variant is produced outside the bacterial host cell that will 
ultimately produce the polypeptide of interest, the nucleic acid is introduced into the appropriate bacterial cell 
using any suitable method, including transfection and transformation by a vector encoding the PstS variant and, 
preferably, integration into the chromosome of the bacterial cells by any suitable method known in the art. An 

20 example of insertion of the pstS gene into the host genome includes that using the £ coli species as host. In this 
case included in the vector for transformation is a DNA sequence that is complementary to a sequence found in 
E, coli genomic DNA. Transfection oiE. coli with this vector results in homologous recombination with the 
genome and insertion of the pstS variant gene. The host for this purpose is either pstS minus or has its wild-type 
pstS gene replaced by the pstS variant gene upon integration thereof 

25 Bacterial cells used to produce the polypeptide of interest of this invention are cultured in suitable media 

in which the alkaline phosphatase promoter can be partially or completely induced as described generally, e.g., 
in Sambrook et al.^ supra. The culturing need never take place in the absence of inorganic phosphate or at 
phosphate starvation levels. At fust, the medium contains inorganic phosphate in an amount above the level of 
induction of protein synthesis and sufficient for the growth of the bacterium. As the cells grow and utilize 

30 phosphate, they decrease the level of phosphate in the medium, thereby causing induction of synthesis of the 
polypeptide. 

Any other necessary supplements besides carbon, nitrogen, and inorganic phosphate sources may also 
be included at appropriate concentrations introduced alone or as a mixmre with another supplement or medium 
such as a complex nitrogen source. The pH of the medium may be any pH from about 5-9, depending mainly 
35 on the host organism. Preferably, the medium contains no reduced glutathione, arid the bacteria are not cultured 
so as to over-express nucleic acid encoding the heat-shock transcription factor, RpoH. 

For induction, typically the cells are cultured until a certain optical density is achieved, e.g., a A^^q of 
about 60-80, at which point induction is initiated (e.g., by addition of an inducer, by depletion of a medium 
component, etc.), to induce expression of the dsbA or dsbC gene. When the optical density reaches a higher 
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amount, e.g., a A^^q of about 80-100, induction of the alkaline phosphatase promoter for the heterologous 
polypeptide is effected. 

Gene expression may be measured in a sample directly, for example, by conventional northern blotting 
to quantitate the transoiption of mRNA. Thomas, Proc. Natl. Acad. Sci. USA. 77: 5201-5205 (1980). Various 

on 

5 labels may be employed, most commonly radioisotopes, particularly ^^P. However, other techniques may also 
be employed, such as using biotin-modified nucleotides for introduction into a polynucleotide. The biotin then 
serves as the site for binding to avidin or antibodies, which may be labeled with a wide variety of labels, sudi 
as radionuclides, fluorescers, enzymes, or the like. 

Procedures for observing whether an expressed or over-expressed gene product is secreted are readily 

10 available to the skilled practitioner. Once the culture medium is separated from the host cells, for example, by 
centrifiigation or filtration, the gene product can then be detected in the cell-free culture medium by taking 
advantage of known properties characteristic of the gene product Such properties can include the distinct 
unmunological, enzymatic, or physical properties of the gene product. 

For example, if an over-expressed gene product has a unique enzyme activity, an assay for that activity 

1 5 can be performed on the culture medium used by the host cells. Moreover, when antibodies reactive against a 
given gene product are available, such antibodies can be used to detect the gene product in any known 
immunological assay (e.g., as in Harlowe et ai. Antibodies: A Laboratory Manual. Cold Spring Harbor 
Laboratory Press, New Yoric, 1988). 

The secreted gene product can also be detected using tests that distinguish polypeptides on the basts 

20 of characteristic physical properties such as molecular weight. To detect the physical properties of the gene 
product, all polypeptides newly synthesized by the host cell can be labeled, e.g., with a radioisotope. Conmion 
radioisotopes that can be used to label polypeptides synthesized within a host cell include tritium (^H), carbon- 14 
(^"^C), sulfur-35 (^^S), and the like. For example, the host cell can be grown in "^^S-methionine or ^^S-cysteine 
medium, and a significant amount of the ^^S label will be preferentially incorporated into any newly synthesized 

25 polypeptide, including the over-expressed heterologous polypeptide. The ^^S-containing culture medium is then 
removed and the cells are washed and placed in fresh non-radioactive culture medium. After the cells are 
maintained in the fresh medium for a time and under conditions sufQcient to allow secretion of the ^^S- 
radiolabeled expressed heterologous polypeptide, the culture medium is collected and separated from the host 
cells. The molecular weight of the secreted, labeled polypeptide in the culture medium can then be determined 

30 by known procedures, e.g., polyacrytamide gel electrophoresis. Such procedures, and/or other procedures for 
detecting secreted gene products, are provided in Goeddel, D.V. (ed.) 1990, Gene Expression Technology, 
Methods in Enzvmologv. Vol. 1 85 (Academic Press), and Sambrook et aL, supra. 

For secretion of an expressed or over-expressed gene product, the host cell is cultured under conditions 
sufficient for secretion of die gene product Such conditions include, e.g., temperature, nutrient, and cell density 

35 conditions that permit secretion by the cell. Moreover, such conditions are those under which the cell can 
perform basic cellular functions of transcription, translation, and passage of proteins from one cellular 
compartment to another, as are known to those skilled in the art. 

The polypeptide of interest is recovered from the periplasm or culture medium as a secreted 
polypeptide. It is often preferred to purify the polypeptide of interest from recombinant cell proteins or 
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polypeptides and from the DsbA or DsbC protein to obtain preparations that are substantially homogeneous as 
to the polypeptide of interest As a first stq), the culture medium or tysate is centrifiiged to remove particulate 
cell debris. The membrane and soluble protein fractions may then be separated if necessary. The polypeptide 
may then be purified from the soluble protein fraction and from the membrane fraction of the culture lysate» 
5 depending on whedier the polypeptide is membrane bound, is soluble, or is present in an aggregated form. The 
polypeptide thereafter is solubilized and refolded, if necessary, and is purified fitim contaminant soluble proteins 
and polypeptides. In a preferred embodiment, the aggregated polypeptide is isolated, followed by a simultaneous 
solubilization and refolding step, as disclosed in U.S. Pat No. 5,288,931. 

Another preferred method for isolating exogenous polypeptides from a complex biological mixture 

10 containing polypeptides and non-polypeptides contained in a fermentation broth, such as the one described 
above, involves contact of reagents with the cells, preferably the cell culture, containing the polypq}tide in a non- 
native conformation, so that an aqueous extraction/isolation can take place. Preferably, the method entails direct 
addition of reagents to the fermentation vessel after the polypeptide has been produced recombinant^, thereby 
avoiding extra steps of harvesting, homogenization, and centrifugation to obtain the polypeptide. While the 

1 5 remaining particulates can be removed by Gaulin homogenization and re-suspension, filtration, or a combination 
thereof, this method utilizes a multiple-phase extraction system for purifying recombinant polypeptides from the 
remaining particulates. 

In particular, this method is preferred for non-native mammalian polypeptides produced recombinantiy 
in bacterial cells, including £ co//, which form refractile bodies in the periplasm of the cells. In this system, one 
20 or more denaturants (chaotropic agent), such as urea, guanidine hydrochloride, and/or a base, and a reducing 
agent, such as dithiothreitol or cysteine, are added to the polypeptide<ontaining medium and then phase-foiming 

species are added to the broth. Once tiiis second group of reagents is added to the broth, multiple phases are 
formed whereby one phase is enriched in the polypeptide and depleted in biomass solids and nucleic acids. 
Preferably, the system has two to four phases, and more preferably two phases, one being enriched in polypeptide 
25 and the other being enriched in biomass solids and nucleic acids. Preferably, the desired polypeptide partitions 
to the upper phase so that the upper phase is enriched in the polypeptide and depleted in the biomass solids and 
nucleic acids. 

Specifically, after fermentation is complete, the ceil culture is contacted with one or more chaotropic 
agents, an optional reducing agent, and phase-forming reagents so that multiple phases are formed, one phase 
30 of which is enriched in the polypeptide of interest. It is preferred to add the chaotrope and reducing agent first 
to extract the polypeptide from the cell and maintain its solubility in the broth before the phase-forming reagents 
are added. Also, while the polypeptide of interest can be extracted from (and enriched in) any phase, preferably 
it is recovered from the uppermost phase. 

Most preferably, the chaotropic agent and optional reducing agent are added directly to the fermentation 
35 broth in the fermentation vessel before isolation of the polypeptide so that the reagents permeate the cells and 
the polypeptide is solubilized and diffuses to the surrounding medium. The reducing agent is added if the 
polypeptide contains at least one sulfhydryl group. 

Examples of suitable reducing agents include dithiothreitol (DTT), p-mercaptoethanol (BME), cysteine, 
thioglycolate, and sodium borohydride. The amount of reducing agent to be present in the buffer will depend 
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mainly on the type of reducing agent and chaotropic agent, the type and pH of the buffer employed, and the type 
and concentration of the polypeptide in the buffer. An effective amount of reducing agent is that which is 
sufRcient to eliminate intennolecular disulfide-mediated aggregation. For example, with 0.5-6 mg/mL IGF-I in 
a buffered solution at pH 7.5-10.5 containing M M urea, the DTT concentration is at about 1-20 mM, and the 
5 concentration of cysteine is at about 10-50 mM. The preferred reducing agent is DTT at about 2-10 mM or 
cysteine at about 30-50 mM. 

Chaotropic agents suitable for practicing this method of extraction include, e.g., urea and salts of 
guanidine or thiocyanate, more preferably urea, guanidine hydrochloride, or sodium thiocyanate. The amount 
of chaotropic agent necessary to be present in the buffer depends, for example, on the type of chaotropic agent 

10 and polypeptide present The amount of chaotropic agent to be added to the fermentation broth will be 
sufHciently high to extract the polypeptide from the cell and maintain its solubility in the broth. If the 
polypeptide is to be extracted from the top phase, the amount of chaotropic agent must be sufficiently low so tfiat 
after addition of the phase-forming species, the density is not increased to a point where the solids rise to the top 
instead of settling to the bottom. Generally the concentration of chaotropic agent is about 0. 1 to 9 M, preferably 

15 about 0.5-9 M, more preferably about 0.5 to 6 M, and most preferably about 0.5-3 M. Also, preferably the 
chaotropic agent is added to the culture medium before the phase- forming reagents are added. The preferred 
chaotropic agent herein is urea at about 1 .5-2.5 M, more preferably at about 2 M, or guanidine hydrochloride 
at about 0.5-3 M. Most preferably, the chaotropic agent is urea. 

The concentration of the polypeptide in the aqueous solution to which the chaotrope and reducing agent 

20 are added must be such that the polypeptide will be recovered in the maximum yield. The exact amount to 
employ will depend, e.g., on the type of polypeptide and the concentrations and types of other ingredients in the 
aqueous solution, particularly the reducing agent, chaotropic agent, phase-forming species, and pH. For 
polypeptides in general, the preferred concentration of polypeptide is about 0.1 to 15 mg/mL. The preferred 
concentration of IGF-I (resulting in the maximum yield of denatured or non-native IGF-I) is in the range of 0.5-6 

25 mg per mL, more preferably 1 .5-5 mg/mL. 

The types of phase-forming species to employ herein depend on many factors, including the type of 
polypeptide and the ingredients in the fermentation broth being treated. The species must be selected so that the 
polypeptide does not precipitate and one phase is more hydrophobic than the other phase so that the polypeptide 
will be located in the more hydrophobic phase and the biomass solids and nucleic acids will settle to the less 

30 hydrophobic phase. 

The phase-forming species may be a combination of agents, including polymer combinations (polymer- 
polymer), polymer-salt combinations, solvent-salt, and polymer-solvent combinations. Suitable polymers are 
both highly hydrophilic polymers and less hydrophilic polymers, i.e., any phase-forming polymers that are known 
in the art Examples include polyethylene glycol or derivatives thereof, inchiding various molecular weights of 

35 PEG such as PEG 4000, PEG 6000, and PEG SOOO, derivatives of PEG described, for example, in Grunfeld et 
ai, A ppl. Biochem. Biotechnol.. 22: 11 7- 138 (1992), polyvinylpyrrolidone (PVP), in a preferable molecular 
weight range of about 36,000 to 360,000, starches such as dextran (eg., dextran 70 and 500), dextrins, and 
mahodextrins (preferable molecular weight between about 600 and 5,000), sucrose, and Ficoll-400™ polymer 
(a copolymer of sucrose and epichlorohydrin). The preferred polymer herein is polyethylene glycol. 
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polypropylene glycol, polyvinyipynolidone, or a polysaccharide such as a dextran. The most preferred polymer 
herein is PEG of different molecular weights or a PEG-polypropylene glycol combination or copolymer. 

Examples of suitable organic solvents include ethylene glycol, glycerol, dimethyl sulfoxide, 
polyvinylaicohol, dimethylformamide, dioxane, and alcohols such as methanol, ethanol, and 2-propanol. Such 
5 solvents are such that, when added to aqueous solution, they increase the hydrophobicity of the solution. 

The salts can be inorganic or organic and preferably do not act to precipitate the polypeptide. Salts 
containing transition elements are not prefen^ed as they tend to precipitate the polypeptide. Anions are selected 
that have the potential for forming aqueous multiple-phase systems. Examples include ammonium sulfate, 
sodium dibasic phosphate, sodium sulfate, ammonium phosphate, potassium citrate, magnesium phosphate, 

10 sodium phosphate, calcium phosphate, potassium phosphate, potassium sulfate, magnesium sulfate, calcium 
sul&te, sodium citrate, manganese sulfate, manganese phosphate, etc. l^ypes of salts that are useful in forming 
bi-phasic aqueous systems are evaluated more fully in Zaslavskii et al, J. Chrom.. 267-281 (1988). 
Preferred salts herein are sulfates, phosphates, or citrates and are alkali or alkaline earth metals. More preferred 
are sulfates and citrates, and most preferred are sulfates since there are fewer pH limitations with sulfates. The 

15 most preferred salts herein are sodium sulfate and sodium citrate. 

The amounts of phase-forming species to add to the polypeptide of interest to obtain a satisfactory 
multiple-phase system are those known in the art. The amount of phase-forming species added to the polypeptide 
will depend on such fiactors as, for example, the amount of chaotropic agent and reducing agent, if any, already 
present in the fermentation broth, the nature of the cell culture media, the type of celb used in the fermentation, 

20 the type of polypeptide being treated, whether the polypeptide will be recovered from the lower or upper phase, 
and the type(s) of phase-forming species being added. The general concentration of polymer employed is about 
5% (w/w) up to the limit of solubility for the polymer and the concentration of salt employed is about 3% (w/w) 
up to the limit of solubility for the salt, depending on the size of the phase- volume ratio needed. The phase- 
volume ratio must be sufficient to accommodate the biomass solids. The types and amounts of phase-forming 

25 species that are effective can be determined by phase diagrams and by evaluating the fmal result, Le. , the degree 
of purity and the yield of the polypeptide of interest. If the phase-forming species are a polymer-salt 
combination, preferably the concentration of salt added is about 4- 1 5% (w/w) and the concentration of polymer 
is 5-18% (w/w) so that the desired polypeptide will be in an opposite phase from that in which the biomass solids 
and nucleic acids are present. . 

30 If the system desired is one where the polypeptide is distributed m the top phase and the biomass solids 

and nucleic acids are in the bottom phase, then there is a window of concentrations of phase-forming species. 
When higher amounts of chaotropic agent are added to maintain solubilization, the higher the amount of phase- 
fomiing species required. However, a high concentration of all these reagents will increase the density of the 
soludon. A high density will cause die biomass solids to settle less readily. An overly high density will cause 

35 biomass solids to float on the surface. Hence, the concentrations of chaotropic agent and phase-forming species 
must be sufficiently high to maintain a fiilly solubilized polypeptide, but low enough to allow the biomass solids 
to sediment to the opposite (lower) phase. 

If the polypeptide is to be recovered in the upper phase, typically the salt concentration will be about 
4-7% (w/w) and the polymer concentration will be about 12-18% (w/w), depending, &g., on the type of salt, 
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polymer, and polypeptide. If an organic solvent is added as a phase-forming species, such as ethanol, it is 
prefmbly added in a concentration of about 10 to 30% (vohune/volume) of the solution, depending, eg., on the 
type of polypeptide and alcohol and if any other phase-fonning species is present, preferably at a concentration 
of about 20% (v/v). 

S The exact conditions for contacting the cell culture with the various reagents will depend on, e.g., the 

pH of the buffer, the types of phase-forming reagents, and the types and concentrations of polypeptide and 
diaotropic and reducing agents. The reaction temperature is generally about 20-40** C, more preferably room 
temperature. The contacting step will generally be carried out for at least about 30 minutes, preferably about 30 
minutes to 12 hours depending on whether side-reactions will occur, more preferably about 30 minutes to 8 
10 hours, and most preferably about 30 minutes to 1 .5 hours. 

If the polypeptide is being unfolded, the degree of unfolding is suitably determined by chromatography 
of the non-native polypeptide, including hydrophobic interaction chromatography or ion-exchange 
chromatography. Increasing peak area for the non-native material indicates how much non-native polypeptide 
is present. 

15 Once the multiple-phase system is established, one phase will be enriched in the polypeptide and 

depleted in the disrupted particles and cells comprising the biomass solids and nucleic acids. In a two-phase 
system, preferably the top phase is enriched in the polypeptide whereas the bottom phase is enriched in the 
disrupted particles and cells. The polypeptide can be easily recovered by separation of the phases. This recovery 
step may be accomplished by decanting the upper phase, by draining the lower phase, or by centrifugatiorL The 

20 polypeptide can then be isolated from die phase in which it is contained by changing the pH of the phase so as 
to precipitate the polypeptide or by adding a suitable solvent, whereupon the precipitated polypeptide is suitably 
recovered by centrifugation or filtration or as a slurry. Alternatively, the polypeptide can be recovered from the 
polymer-containing phase by re-extraction by addition of a suitable polymer, salt, or solvent. In the case of IGF- 
I, the polypeptide is recovered from the isolated polymer phase by lowering the pH so that the IGF-I will 

25 precipitate, resulting in a yield of IGF-I of as much as or more than about 97%. The DsbA or DsbC protein 
would be separated from the polypeptide at this stage. 

Once obtained from the liquid phase of die muhiple-phase system, or at a later stage of purification, the 
polypeptide is suitably refolded into an active conformation. One suitable refolding method diat can be utilized 
is that which follows. 

30 After the polypeptide is solubilized and exu'acted by the multiple-phase extraction system herein, it is 

placed or diluted into a buffer containing solvent, chaotropic agent, salt, and a minimal amount of a copper or 
manganese salt. This buffer unexpectedly increases refolding yields of polypeptide from any type of host This 
buffer is at a pH of about 7 to 12, depending mainly on the type of polypeptide and reducing agent, preferably 
about 8 to 1 1, more preferably pH 8.5 to 1 1, and most preferably 8.5 to 10.5. 

35 One key ingredient of the buffer is an alcoholic or polar aprotic solvent at a concentration of about 5- 

40% (v/v), preferably 10 to 30% (volume/volume) of the solution, depending, on the type of polypeptide 
and solvent, and the concentration of chaotropic agent. It is most preferably at a concentration of about 20% 
(v/v). 
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A second key ingredient to this buffer is an alkaline eanfa, alkali metal, or ammonium salt, which is 
present in a concentration of about 0.2 to 3 M, preferably 0.2 to 2 M, depending mainly on the chaotrope 
concentration, solvent concentration, and the type of alkaline earth, alkali metal, or anunonium salt and 
polypeptide employed. For example, if the cation is sodium, potassium, or ammonium, the concoitration is about 
S 0.5 to 3 M, but if the cation is magnesium, the concentration is about 0.2 to 1 M. 

A third key ingredient of the buffer is an effective amount of a chaotropic agent The amount of such 
chaotrope will depend mainly on the concentration of alkaline earth, alkali metal, or ammonium salt, die 
concentration of solvent, the specific type of alkaline earth, alkali metal, or ammonium salt employed, the 
specific type of chaotropic agent employed, and the type of polypeptide, as well as the pH of the buffer, but in 

10 general will range from about 0.1 to 9 M, preferably about 0.5 to 6 M, and most preferably about 1.5 to 4 M. 
As to specific chaotropes, preferably about 0.1 to 2 M of guanidine hydrochloride, and preferably about 1-3 M, 
more preferably about 1-2.5 M, and most preferably about 2 M, of urea is utilized. 

A fourth key ingredient of the buffer is an effective amount of a transition metal salt selected from 
copper and manganese salts so that oxidation and resultant refolding will occur. The amount of copper or 

1 5 manganese salt depends mainly on the type of transition metal and polypeptide employed and the oxygen level 
present. The lower the rate of oxygen addition or the oxygen level, the higher the amount of copper or 
manganese salt that can be employed. The copper or manganese salt concentration is typically about 0.01 to 15 
^M, preferably about 0.01 to 10 ^M, more preferably about 0.01 to 5 \iM, and even more preferably about 0.01 
to 0.5 ^M. The above preferred ranges are particularly preferred for IGF-L If the concentration is increased 

20 beyond about 15 ^M, unexpectedly the yield of correctly folded polypeptide decreases dramatically. Most 
preferably, the concentration of a copper or manganese salt is about 0.5 |tM. The transition metal sah may 
already be present in the buffer without addition of exogenous transition metal salt, for example, if it is residual 
from the fermentation, or it may be added to the buffer, or both. 

The buffer can be any of those listed above for the first buffered solution, with CAPSO, glycine, and 

25 CAPS being preferred at pH 8.5- 1 1 , particularly at a concentration of about 20 mM, and most preferably CAPSO 
and glycine. The polypeptide may be diluted with the refolding buffer, preferably at least five fold, more 
preferably at least about ten fold. Alternatively, the polypeptide may be dialyzed against the refolding buffer. 
The refolding is typically carried out at about 0-45° C, preferably about 20-40" C, more preferably about 23- 
37" C, even more preferably about 25-37" C, and most preferably about 25" C for at least about one hour. The 

30 preferred temperature is not apparently affected by salt, solvent, and chaotropic agent levels, but may be affected 
by the presence of sucrose and glycerol, m which case it should be kept above about 20" C. The solution 
optionally also contains a reducing agent and an osmolyte. 

The reducing agent is suitably selected from those described above for the solubilizing step in the 
concentration range given. Its concentration will depend especially on the concentrations of alkaline earth, alkali 

35 metal, or ammonium salt, polypeptide, and solvent Preferably, the concentration of reducing agent is about 0.5 
to 8 mM, more preferably about 1-5 mM, even more preferably about 0.5-2 mM. The preferred reducing agents 
are DTT and cysteine. 
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The optional osmolyte is preferably sucrose (in a concentration of about 025-1 M) or glycerol (in a 
concentration of about I -4 N4). More preferably, the sucrose concentration is at about 1 M and the glycerol 
concentration is at about 4 M. 

The initial concentration of polypeptide in the folding buffer is such that the ratio of correctly folded . 

5 to misfolded conformer recovered will be maximized, as determmed by HPLC, RIA, or bioassay. The exact 
concentration will depend, for example, on the type of polypeptide employed. The preferred concentration of 
polypeptide (resulting in the maximum yield of correctly folded conformer) is in the range of about 0.1 to 15 
mg/mL, more preferably about 0.1 to 6 mg/mL, and most preferably about 0.2 to 5 mg/mL. 

In addition, a source of oxygen such as air or oxygen gas is entrained in or otherwise introduced into 

10 the buffer so as to effect oxidation together with the copper or manganese salt. The oxygen can be present in 
the buffer at any point in time, including before the polypeptide or any other reagents are added to the buffer. 

The amount of oxygen source introduced will depend, eg., on the type of vessel utilized, the type and 
concentration of polypeptide, the type of oxygen source, the type and amount of copper or manganese salt, and 
the type and amount of reducing agent present, if any, and the type and amount of chaotropic agent present as 

15 well as the pH of the buffer. Generally, the oxygen source will be introduced by passive means (e.g., as air in 
head space in a ratio of air space to fluid volume of 2:1) using an agitator. Alternatively, the oxygen source may 
be introduced by bubbling through a sparger. The rate of introduction of the oxygen must be sufficient to allow 
folding to reach completion in preferably about 1 to 12 hours, more preferably about 1 to 6 hours, and most 
preferably about 1 to 3 hours. The addition of molar oxygen is proportional to the reductant concentration and 

20 polypeptide concentration, but inversely proportional to the copper or magnesium salt concentration. The rate 
of oxidation is limited by the level of catalyst, not by the oxygen addition rate. A higher sparging rate is required 
for larger volume folding. 

The degree of refolding that occurs upon this second incubation is suitably determined by the RIA titer 
of the polypeptide or by HPLC analysis using e.g., a Vydac or Baker C-18 column, with increasing RIA titer or 

25 correctly folded polypeptide peak size directly correlating with increasing amounts of correct, biologically active 
polypeptide conformer present in the buffer. The incubation is carried out to maximize the yield of correctly 
folded polypeptide conformer and the ratio of correctly folded polypeptide conformer to misfolded polypeptide 
conformer recovered, as determined by RIA or HPLC, and to minimize the yield of multimeric, associated 
polypeptide as determined by mass balance. 

30 The following procedures are exemplary of suitable purification procedures: fractionation on 

immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 
or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; 
and gel filtration usmg, for example, Sephadex G-75. 

The following examples are offered by way of illustration and not by way of limitation. The disclosures 

35 of all patent and scientific references cited in the specification are expressly incorporated herein by reference. 

EXAMPLE I 

The dsbA nucleotide sequence was amplified using PGR techniques from pi 6-1 (Bardwell et aL, supra, 
1991) and aJCbal site was added at tiie 5* end and a Clal site at the 3* end of the DsbA-encoding sequence. This 
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DNA fragment was digested with the appropriate restriction enzymes and ligated with a 1.0-kb Fstl-Xbal 
fragment of pKMTacII (deBoer et a!., Proc. Natl. Acad. Sci. USA. Jfi: 21-25 [1983]) and a 3.6-kb Clal-Psil 
fragment of pBKIGF-2B (U.S. Pat. No. 5342,763). The resulting plasmid, pJJ4l (see Fig. 1), contains dsbA 
under tfie control of the racll promoter and confers resistance to ampicillin and tetracycline. This plasmid was 
5 digested with Cial and die ends were filled in by Klenow fragment and deoxynucleotides. 

After inactivation of the Klenow fragment and extraction of the digested DNA, the DNA was 
subsequently digested with Seal and the 1 .5-kb fragment containing the 5' end of the p -lactamase gene and the 
Tacll'dsbA element was purified. This was ligated with a 4.6-kb fragment of £coRI-digested and filled-in 
pBKIGF-2B, which had also been digested with Seal. This fragment of pBKIGF-2B contained the 3* end of the 

10 P-Iactamase, allowing selection for resistance to ampicillin as well as the DNA sequences for resistance to 
tetracycline and production of IGF-I. This plasmid is denoted pJJ42 (see Fig. 1). 

This plasmid was transfonned into K coli strain 33D3 (with the genotype W3 1 1 ptr3 laclq LacLS 
ompTdegP karf). This strain was constructed in several steps using techniques involving transduction with 
phage Pike, derived from PI (J. Miller, Experiments in Molecular Genetics. Cold Spring Harbor, NY, Cold 

1 5 Spring Harbor Laboratory, 1 972) and transposon genetics (Kleckner et ai , J. Mol. Biol.. 1 16 : 1 25- 1 59 [ 1 977]). 
The starting host used was E. coli K*12 W3 1 10, which is a K- 12 strain that is F- lambda- (Bachmann, Bact. Rev.. 

525-557 [1972]; Bachmann, "Derivations and Genotypes of Some Mutant Derivatives of Escherichia coli 
K-12,'* p. 1 190-1219, in F. Q Neidhardt et al,, ed., Escherichia coli and Salmonella tvphimurium: Cellular and 
Molecular Biology, vol. 2, American Society for Microbiology, Washington, D.C., 1987). 

20 Introduction of the tonA (JhuA) mutation and ptrS mutation into the genome is described in detail in U.S. 

Pat No. 5,304,472 issued April 19, 1994. The resulting strain is designated 9E4. The lacfl (Muller-Hill et a!., 
Proc. Natl. Acad. Sci. USA. 59: 1259 [1968]) and lacLS (Scaife and Beckwith, Cold Spring Harbor Svmp. 
Quant. Biol.. 31: 403 [1967]) mutations were introduced into strain 9E4 by co-transduction with proC. First, 
a linked Tn5 insertion into proC and a phoA A El 5 (Sarthy , A. et ai , L BacterioL . J45 : 28 8-292 [1981]) mutation 

25 were introduced. PI transduction to proline prototrophy restored the proC gene and iYit phoA gene and also 
introduced the lacfl lacL8 mutations. The lacfl mutation results in an overproduction of lac repressor compared 
to wild'type. The presence of the lacfl mutation can be observed on LB plates containing the chromogenic 
substrate 5-bromo-4-chloro-3-indolyl-P-D-galactoside (X-gal). On these plates a lact^ strain will be light blue, 
whereas a lacfl strain will be colorless. The LacLS mutation is a promoter mutation that results in lower lac 

30 operon enzyme levels. The presence of the lacL8 mutation can be detected on MacConkey 1% lactose medium 
as colonies with a dark red center and beige surrounding edge. 

Introduction of the ompr deletion and the degP mutation into the strain is described in detail in U.S. 
Pat. No. 5,304,472. 

The final strain 33D3 is resistant to Tl and 08O phage, lacks three proteases, overproduces the lac 
35 repressor, and produces lower than wild-type lac operon enzyme levels when induced. 

33D3 cells containing pJJ42 were then grown in a 1 0-liter fermentor under conditions identical to IGF-I 
production as described in U.S. Pat. No. 5,304,472, Example II, 1 x feed conditions. When the optical density 
(A55Q) of the culture reached 60, IPTG or lactose was added in the amounts given below to induce the tacW 
promoter, thereby increasing dsbA expression. The growth medium was designed for phosphate depletion and 
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phoA promoter induction to occur at an optical density (A^^q) of 80 to 100. Total IGF-I production was 
measured by a high-pressure liquid chromatography (HPLC) assay described by U.S. Pat No. 5304,472. 
The results are shown in Table I below: 



TABLE 1 



Fermentation Run 


Conditions 


g/L IGF-I 


S1968 


pBKIGF-2B, no inducer 


3.0 


S1963 


pJJ42, 0.05 mM IPTG 


4.8 


S1964 


pJJ42, 1.0 mM IPTG 


4.8 


S1959 


pJJ42, 1.0% lactose 


4.0 



10 The results show that over-expression of the dsbA gene in the periplasm of the bacteria improves the 

yield of IGF-I for all runs that were induced by either inducer. The increase in IGF-I accumulation was about 
60%. The correctly folded IGF-I can be recovered in high yield by liquid-liquid extraction followed by aerobic 
refolding as described above or by methods described in U.S. Pat. No. 5,288,93 1 . 

EXAMPLE II 

15 The dsbC nucleotide sequence was amplified using PGR techniques from the genome of E. coli 

(Missiakas et ai, supra, [1994]) and a Abal site was added at the 5* end and a Clal site at the 3* end of the DsbC- 
encoding sequence. This DNA fragment was digested with the appropriate restriction enzymes and ligated with 
a LO-kb Pst\-Xba\ fragment of pKMTacII (de Boer et aL Proc. Natl. Acad. Sci. USA. SQ: 21-25 [1983]) and 
a 3.6-kb ClaUPstl fragment of pBKIGF-2B (U.S. Pat. No. 5,342,763). The resulting plasmid. pJJ37 (see Fig. 

20 2), contains eisbC under the control of the tacll promoter and confers resistance to ampicillin and tetracycline. 
This plasmid was digested with Clal and the ends were filled in by Klenow fragment and deoxynucleotides. 

After inactivation of the Klenow fragment and extraction of the digested DNA, the DNA was 
subsequently digested with Seal and the 1.5-kb fragment containing the 5' end of the P-lactamase gene and the 
TacU-dsbC element was purified. This was ligated with a 4.6-kb fragment of £coRI-digested and filled-in 

25 pBKIGF-2B, which had also been digested with Scah This fragment of pBKIGF-2B contained the 3' end of the 
P-lactamase, allowing selection for resistance to ampicillin as well as the DNA sequences for resistance to 
tetracycline and production of IGF-I. This plasmid is denoted pJJ40 (see Fig. 2). 

This plasmid pJJ40 was transformed into E. coli strain 33D3, described above. 33D3 cells containing 
pJJ40 were then grown in a 10-liter fermentor under conditions identical to IGF-I production as described in U.S. 

30 Pat. No. 5,304,472, Example II, 1 x feed conditions. When the optical density (A55Q) of the culnu*e reached 
60, 1% lactose was added to induce the tacll promoter, thereby increasing dsbC expression. The growth medium 
was designed for phosphate depletion and phoA promoter induction to occur at an optical density (A^^q) of 80 
to 100. Total IGF-I production was measured by an HPLC assay described by U. S. Pat. No. 5,304,472. 

The results are that the experiment with 1% lactose produced 4 g/L IGF-I, versus the control, pBKIGF- 

35 2B, no inducer, shown in Table 1 above, which produced 3 g/L of IGF-I. The correctly folded IGF-I can be 
recovered in high yield by liquid-liquid extraction followed by aerobic refolding as described above. 
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EXAMPLE III 

The dsbA gene was cloned into a vector pBADlS (Fig. 3) containing a cloned inducible arabinose 
promoter (Guzman et al, supra) that over-expresses the gene in response to the presence of arabinose in the 
medium. The dsbA gene was amplified by PGR, which attached restriction sites at the ends for ease in cloning. 
S An EcoRl site was placed at the S' end and a KpnJ site at Ae 3' end. The vector into which dsbA was cloned was 
digested with the same enzymes and the dsbA PGR product was Hgated with the digested vector to form pBADJ2 
(Fig. 3). This plasmid pBAD12 was digested with Clal and ///ndlll and then made blunt-ended witii the use of 
Klenow polymerase fragment L A 1 .9-kb fragment encoding the arabinose promoter and dsbA was purified. 
This arabinose promoter with the dsbA gene was then cloned into the EcoRV site of the IGF-l production plasmid 

1 0 pBKJGF-2B as described above. The direction of transcription of dsbA was the same as for IGF-I. This plasmid 
was designated as pBKIGF2B-A. 

The plasmid pBKIGF2B-A was then transformed into £. coli strain 36F8 (with the genotype W31 10 
tonA phoA A(argF4ac) ptrS degP kai^ i7vG+, and being temperature resistant at 37° C). This strain was 
constructed in several steps using the same basic techniques described above. The starting host for the 

15 construction 27 A7, is described in detail in U.S. Pat. No. 5,304,472. 

Next, the degP4I kan'^ mutation was introduced into 27 A7. This mutation is described in U.S. Pal. No. 
5,304,472. The resulting strain is designated 27B4. The 27B4 strain grows poorly on LB media at 37° C and 
42** C. A temperature-resistant isolate was obtained at 37° C by picking a spontaneous colony that grew better. 
This strain is designated 3SE7. 

20 A ribose deletion was then introduced into strain 35E7 by PI co-transduction with a linked TnlO 

insertion m the ih gene. The isoleucine/valine auxotrophy was transduced to prototrophy using PI phage grown 
on a strain carrying the i?vG20P5^ mutation (Lawther et al., Proc. Natl. Acad. Sci. USA. 25: 922-925 [1981]), 

D 

which repairs a frameshift that causes the wild-type E, coli K-12 strain to be sensitive to valine. The ilv2096'^ 
locus was confumed by resistance of the 36F8 host to 40 ng/mL valine. The rbs deletion was also repaired 

25 during this step, resulting in a strain able to utilize ribose as a carbon source. Tne final strain 36F8 is resistant 
to Tl and 080 phage, lacks two proteases, does not overproduce alkaline phosphatase upon depletion of 
phosphate in the medium, grows well at 37° C, and is not susceptible to valine toxicity. 

36F8 cells containing the plasmid pBKIGF2B-A were then grown in a 10-liter fermentor under 
conditions identical to the IGF-I production described in U.S. Pat No. 5,304,472, supra, I x feed conditions, 

30 Example III. When the optical density (A55Q) of the culture reached 80, \% (w/v) L-arabinose was added. This 
induced dsbA expression and resulted in secretion of the DsbA protein into the periplasm. The growth medium 
was designed for phosphate depletion and phoA promoter induction to occur at an optical density (A^^q) of 100. 
Total IGF-I production was measured by the HPLC assay used in Example I. 
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The results are shown in Table 2 beiow: 

TABLE 2 



Fermentation Run 


Conditions 


L lor-i 


1 


pBKIGF2B-A, 1 .5% arabinose 


6.1 


2 


pBKIGF2B-A, 1.0% arabinose 


7.6 


3 


pBKIGF2B-A, 1.0% arabinose 


7.2 


4 


pBKIGF2B-A. 0.5% arabinose 


6.6 


5 


typical periplasmic IGF-I 
process 


3.7 



The results show that over-expression of the dsbA gene in the periplasm of the bacteria improves yield 
10 of total (soluble and insoluble) IGF-I for all runs involving the arabinose promoter and DsbA. This increase in 
IGF-I accumulation was approximately 2-fold using 1% L-arabinose. The IGF-I in correctly folded fonn can 
be recovered by liquid-liquid extraction and by refolding as described above, or by methods as described in U.S. 
Pat. No. 5^88,931. 

The yield increase was not seen for DsbB, as cells over-expressing dsbB from a very similar plasmid 
15 do not show the increase. Furthennore, if the dsbA DNA and IGF-I DNA were co-expressed using the same 
promoter, total IGF-I yields were decreased as compared to a control with no dsbA gene (pRKIGF-2B). 

EXAMPLE IV 

For further exploration of the potential advantages of DsbA over-expression, a host organism with a 
mutated phoS protein (as described in U.S. patent 5,304,472) was employed in the process. After the phosphate 

20 feed rate was adjusted, IGF-I polypeptide yields in excess of 9 g/L were obtained. 

In this study, the plasmids used were pBKIGF-2B (U.S. Pat. No. 5,342,763) and pJJIOI. Plasmid 
pJJlOl (Fig. 4) was constructed to allow the expression of dsbA from the arabinose promoter and to use 
kanamycin as the drug resistance marker for plasmid introduaion and retention. It was constructed as follows: 
A 973-bp DrallUBsmYl fragment from pACYC177 (Chang and Cohen, J. Bacteriolo^v. 124: 1 141-1 156 [19781) 

25 was iigated to a 5820-bp Mscl-Smal fragment from pBKIGF2B-A described above. The resulting plasmid 
pJJlOl is both kanamycin and ampicillin resistant. The kanamycin resistance is derived from the pACYC177 
fragment. The IGF-I gene, arabinose-reguiated dsbA element, and ampicillin resistance gene are derived from 
pBKIGF2B-A. 

Two different host organisms were used in this study, 45F8 and 46D5. The genotype of 45F8 is K coii 
30 K12 W3110 AtonA ptr3 A(argF-lac)I69 degP41(APsthkar?) AompT ilvG2096^ phoS*(TIOY), 46D5 is 
similar, but, in addition, contains the following loci, Acyoiikarf rhaR A(rbs7). RhaR and A(rbs7) are a point 
mutation and a deletion which render the organism unable to use liiamnose and ribose. These changes are 
invisible unless ribose or rhamnose are present. They have been shown to have no effect on the yield of IGF-I 
polypeptide. The Acyo::kar{ mutation removes one of the cytochrome oxidases, cytochrome o oxidase. 
35 However, the other, cytochrome d oxidase, is sufficient for normal metabolism and has been shown to have little 
or no effect on the yield of IGF-1 polypeptide. The details of the construction of these strains are provided 
below. 
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The host 46D5 is a derivative of £: coli K12 W3 1 10, described above. The complete genotype of 46D5 
is INfrrnD-rrnE)! AJhuA ptrS A(argF-lac)I69 AompT degP41(APsU'karf) phoS* (TIOY) cyo::kan^rhaR 
A(rbs7) ihG2096^. The derivation of 46D5, diagrammed in Table 3, is described below. 

The starting strain, E, coli W3 1 1 0, is a derivad ve of L coli K- 12 that is F~ and A~. It has bem shown 
5 to carry an inversion of the chromosome between rmD and rrnE, Bachmann ei al. , supra^ and Bachmann, siq>ra. 

The JhuA gene (previously designated tonA) (U.S. Pat No. 5304,472) was deleted from W31 10 by 
imprecise excision of TnW following its insertion into the JhuA gene. The resulting strain, 1 A2, is resistant to 
bacteriophages Tl, T5, and ^80. 

The ptr3 gene (U.S. Pat No. 5,304,472) was introduced into this strain by two steps. Two flanking 
10 markers, thyAS and argA8l::TnlO were transfeired into 1 A2. Transduction with PI phage grown on a ptr3 
mutant resulted in the introduction of the ptr3 mutation simultaneously with the conversion of thyAS and 
argA8J::TnW to wild-type loci. This strain, 9E4, lacks periplasmic protease III. 

Two deletion mutations, phoAAElS and A(argF'lac)l69 (U.S. Pat No. 5,304,472) were 
simultaneously introduced into W31 10 by PI cotransduction with a linked TnS insertion in the proC gene. PI 
15 transduction to proline prototrophy restored the proC gene. The phoAAElS mutation eliminates alkaline 
phosphatase expression, and the A(argF-lac)J69 mutation is responsible for the /ac-minus phenotype of this 
strain, 27 A7. 

The omp 7* deletion (U.S. Pat. No. 5,304,472) was introduced by PI cotransduction with a linked TnW 
insertion in the purE gene. Next, this strain was n^ansduced to purine prototiophy to remove the transposon. The 

20 presence of the ompF deletion was confirmed by colicin B and colicin D resistance in this strain, 27C6. This 
strain lacks the outer membrane protease VII which is encoded by the ompfgene. 

The degP41(APstI-kar{) (U.S. Pat No. 5,304,472) mutation in the gene for a periplasmic protease was 
introduced by transduction. This mutation was constructed in vitro by replacing a section of the degP gene with 
a kanamycin-resistance gene. This is not a transposon, but allows for selection of the deletion using kanamycin 

25 resistance. The resulting strain is 2 7C7. 

A TnlO insenion in the ilv gene was introduced by PI transduction. The isoleucine/valine auxotrophy 
was transduced to prototrophy using PI phage grown on a strain carrying the ilvG2096^ mutation (Lawther et 
al.. Proc. Natl. Acad. Sci. USA. 78: 922-925 [1981]). which repairs a frameshift that causes the wild-type E. coli 
IC-12 to be sensitive to valine. The ilvG2096^ locus was confirmed by the resistance of the 43D3 host to 40 

30 pg/mL valine (0.3mM). 

The degP4I(APstl-kar/) mutation was replaced using PI transduction with &degP41(APslJ-kaf^) 
mutation. A proAB::TnIO which is linked to degP was introduced into 43 D3. The proline auxotroph was 
transduced to prototrophy using PI phage grown on a strain carrying the degP karf mutation. Since degP is 
linked by cotransduction to JhuA, the new strain, 43E7, was con&med to retain resistance to bacteriophage Tl . 

35 The wild-type alkaline phosphatase gene was re-introduced into this host background to capitalize on 

the benefits obtained from the phoS* mutation described below. PI co-transduction of a TnS insertion in the 
proC gene with phoA-^ was used to reintroduce the wild-type alkaline phosphatase gene into this host 
background. PI transduction to proline prototrophy restored the proC gene. The resulting strain, 44D6, regained 
alkaline phosphatase expression and maintained the A(argF-lac) mutation responsible for the lac 'phenotype. 
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A mutation which results in an altered phosphate-binding protein, phoS* (TIOY) (U.S. Pat. No. 
5,304,472) was introduced. The protein has a reduced affinity for phosphate in the medium. The result 
is that induction of the alkaline phosphatase promoter occurs at a higher phosphate concentration than the wild- 
type. This allows product expression fh>m the APase promoter without severely depriving the culture of 
5 phosphate. PhoS is linked by PI cotransduction to ih. A TnIO insertion in the ih gene was re-introduced by 
PI transduction. The isoleucine/valine auxotrophy was transduced to prototrophy using PI phage grown on a 
strain carrying the phoS* (TIOY) and ihG2096^ mutations. The presence of the phoS* mutation results in blue 
colonies on high-phosphate agar media containing the chromogenic substrate, XP. The resulting strain was 45F8. 

The cyo. iharf (Oden et al , Gene. 2fi: 29-36 ( 1 990]) mutation in the gene for cytochrome o oxidase was 
10 introduced by transduction. This mutation was constructed in vitro by replacing a portion of the cyo gene with 
a kanamycin-resistance gene. Introduction of diis mutation prevents switching between the low- and high-affinity 
cytochrome oxidases, cyo and cyd, respectively. The cytochrome switdiing phenomenon can lead to dissolved 
oxygen instabilities and unsuccessful fermentation runs. The resulting strain was 4SF9. 

Finally, two mutations in carbohydrate utilization pathways were introduced. The rhaR (Moralejo et 
15 fl/., J. Bacteriol. . 175 : 5585-5594 [1993]) mutation was introduced by PI co-transduction with orgE, An 
argE::TnJO was introduced. The strain was restored to prototrophy using PI phage grown on a strain carrying 
the rhaR mutation. The resulting strain, 46F1, was confirmed to be unable to utilize rhamnose as a carbon 
source. 

The Arbs7 (Lopilato et ai., J. Bacteriol.. 154: 665-673 [1984]) mutation was introduced by PI 
20 cotransduction with a linked TnJO insertion in the ilv gene. The isoleucine/valine auxotrophy was transduced 
to prototrophy using PI phage on a strain carrying the iivG2096^ mutation. The presence of the ribose- 
utilization defect in the resulting strain, 46D5, was confirmed using minimal media containing rtbose as a carbon 
source. Retention of the phoS* (TJOY) was also confirmed as described above. 

The important characteristics of the fmal strain, 46D5, are that it is resistant to Tl and <I>80 phage, lacks 
25 three proteases, is not susceptible to valine toxicity, does not undergo cytochrome switching, and allows phoA 
promoter induction at higher phosphate concentrations. 

Table 3 indicates the strains and genotypes involved in the construction of 46D5. 
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Table 3 



Strain 


Genotype 


W3110 


K-12 P- IN(rmD-rrnE)l 


1A2 


W3110 AJhuA 


9E4 


W3110 AftiuAptrS 


27A7 


W3 1 10 AJhuA ptr3 phoAAElS A(argF4ac)169 


27C6 


W3 1 10 nJhuA ptr3 phoAnLJj n(argr-lacJJo9 £iompl 


27C7 


W31 10 £\jnuA ptrJ pnoA/iti5 £\(argr-tac)io9 nompi aegF4I(npstI^Kan ) 


43D3 


W31 10 ptr3 phoAAElS A(argF-lac)I69 AompT degP4l(APst}'karf) 
ivG209o 


43E7 


W3110 A(argF-lac)l69 AompT AJhuAptrS phoAAElS degP4l(APstl'karf) 


44D6 


W3110 idiyTiu/i ptr3 A(argF-lac)!69 degP4](Apstl''karf) AompT ilvG2096^ 


45F8 


W3 1 10 AfhuA pirS A(argF4ac)169 degP41(Apsti-karf) AompT phoS*(T10Y) 
ilvG2096^ 


45F9 


W31 10 AfliuA ptrS A(argF-lac)169 degP41(Apstl-katf) AompT iIvG2 096^ 
phoS*(TJOY) Acyo::karr 


46F1 


W3 1 10 AJhuA ptrS A(argF-lac)169 degP41(Apstl-karf) AompT ilvG2096^ 
phoS*(T10Y) Acyo::karf rhaR 


46D5 


W3 1 1 0 AflluA ptrB A(argF'lac)169 degP41(Apstl'karf) AompT ilvG2096^ 

phoS^mOY) Acvor.kaf/ rhaR A(rbs7) 



The fermentations were conducted essentially as described in U.S. Pat No. 5,304,472, Example III, Ix 
feed conditions. However, in all cases the complex nitrogen feed was started at the beginning of the fermentation 
and no isoleucine was added to the medium. In most cases, a 70% wt/vol glucose feed was used, but for SI 1339 
and SI134I, a 85% wt/vol glucose feed was employed. In all cases with the plasmid pJJlOl, DsbA expression 
20 was induced by adding arabinose to the culture to produce a 1% concentration at a cell density of 80 OD (A^^q). 
No induction was done for the control case (run SIl 3 12). Rim SII341 was conducted at a higher agitation rate 
(1 100 to 1200 rpm) as compared to the other fermentations, which were all conducted at 1000 rpm. Table 4 
provides a summary of the experiments as well as peak titers. The time course of IGF-I polypeptide 
accumulation for each is shown in Figure 5. 
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Table 4 



Run No. 


Organism 


Complex N Feed Rates 
(mL/min) 

Initial After 
120 op 


RPM 


Glucose 
Feed 
(wt%) 


Peak 
Titers 
(g/L) 


SI1312 


46D5/pBKIGF-2B 


0.5 


0.25 


1000 


70 


6.4 


SI 1306 


45F8/pJJ101 


0.5 


0.25 


1000 


70 


7.8 


SI1329 


45F8/pJJ101 


0.5 


OJ 


1000 


70 


9.0 


811339 


45F8/pJJ101 


0.6 


0.6 


1000 


85 


9.6 


SI1341 


45F8/PJJ101 


0.6 


0.6 


1100-1200 


85 


9.3 



In all cases with dsbA over-expression, the rate of IGF-I polypeptide accumulation and the fmal titer 
are significantly improved relative to the case without dsbA overexpression. Furthermore, in comparison to the 
10 results in Example III, significantly higher titers are obtained when a host with a phoS mutation and an active 
phoA gene is combined with dsbA overexpression for the production of IGF-I polypeptide. 
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WHAT IS CLAIMED IS: 

1 . A process for producing a heterologous polypeptide in bacteria comprising: 

(a) culturing bacterial cells that lack their native pstS gene and comprise nucleic acid encoding 
a PstS variant having an amino acid variation within the phosphate-binding region of the corresponding 

5 native PsiS, nucleic acid encoding a DsbA or DsbC protein, nucleic acid encoding the heterologous 

polypeptide, a signal sequence for secretion of both the DsbA or DsbC protein and the heterologous 
polypeptide, an inducible promoter for the nucleic acid encoding the DsbA or DsbC protein, and an 
alkaline phosphatase promoter for the nucleic acid encoding die heterologous polypeptide, wherein the 
nucleic acid encoding a PstS variant is under the transcriptional control of the wild-type pstS gene 

10 promoter, and wherein the culturing takes place under conditions whereby expression of the nucleic 

acid encoding the DsbA or DsbC protein is induced prior to induction of the expression of the nucleic 
acid encoding the heterologous polypeptide, under conditions whereby the culturing takes place in a 
culture medium at a concentration of inoiganic phosphate in the medium that during all phases of cell 
growth is above the level at which the cells are starved for phosphate, and under conditions whereby 

1 5 either both the heterologous polypeptide and the DsbA or DsbC protein are secreted into the periplasm 

of the bacteria or the heterologous polypeptide is secreted into the medium in which the bacterial cells 
are cultured; and 

(b) recovering the heterologous polypeptide from the periplasm or the culture medium. 

2. The process of claim 1 wherein the heterologous polypeptide is a mammalian polypeptide. 
20 3. The process of claim 2 wherein the mammalian polypeptide is a human polypeptide. 

4. The process of claim 3 wherem the human polypeptide is insulin-like growth factor-I. 

5. The method of claim 1 wherein the concentration of inorganic phosphate at all phases of cell 
growth is more than about 0.5 mM and wherein the pstS variant is homologous to the native pstS gene in the host 
cells. 

25 6. The process of claim 1 wherein the polypeptide is recovered from the periplasm of the bacterial 

cells. 

7. The method of claim 1 wherein the bacterial cells are Enterobacteriaceae cells. 

8. The method of claim 1 wherein the amino acid variation in the nucleic acid molecule is an 
amino acid substitution. 

30 9. The method of claim 8 wherein the bacterial cells are E. coli and a hydrophobic residue is 

substituted for threonine at position 10 of the phosphate-binding region of native E, coli PstS. 

10. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes TIOF PstS 
under the transcriptional control of the wild-type E. colipstS gene promoter. 

1 1 . The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes Tl OL PstS 
35 under the transcriptional control of the wild-type E, colipstS gene promoter. 

1 2. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes Tl OM PstS 
under the transcriptional control of the wild-type E. colipstS gene promoter. 
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13. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes TlOY PstS 
under the transcriptional control of the wild-type E. colipstS gene promoter. 

14. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes TlOA PstS 
under the transcriptional control of the wild-type £. eolipstS gene promoter. 

S 15. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes TIOC PsiS 

under the transcriptional control of the wild-type £. colipstS gene promoter. 

16. The method of claim 8 wherem the nucleic acid encoding a PstS variant encodes TlOG Ps^ 
under the transcriptional control of the wild-type E. colipstS gene promoter. 

17. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes S38F PstS 
1 0 under the transcriptional control of the wild-type K coii pstS gene promoter. 

1 8. The method of claim 8 wherein die nucleic acid encoding a PstS variant encodes D56V PstS 
under the transcriptional control of the wild-type K colipstS gene promoter. 

1 9. The method of claim 8 wherein die nucleic acid encoding a PstS variant encodes D56A PstS 
under the transcriptional control of the wild-type E. colipstS gene promoter. 

1 5 20. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes D56L PstS 

under the transcriptional control of the wild-type E. colipstS gene promoter. 

2 1 . The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes D56S PstS 
under the transcriptional control of the wild-type E colipstS gene promoter. 

22. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes S 139T PstS 
20 under the transcriptional control of the wild-type £ colipstS gene promoter. 

23. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes S 1 39P PstS 
under the transcriptional control of the wild-type E. colipstS gene promoter. 

24. The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes S 139L PstS 
under the transcriptional control of the wild-type E. colipstS gene promoter. 

25 25 . The method of claim 8 wherein the nucleic acid encoding a PstS variant encodes T 1 4 1 H PstS 

under the transcriptional control of the wild-type £. colipstS gene promoter. 

26. The process of claim I wherein the nucleic acid encoding the PstS variant is integrated into 
the chromosome thereof 

27. The process of claim I wherein nucleic acid encoding DsbA is expressed. 

30 28. The process of claim I wherein after the polypeptide is recovered, the DsbA or DsbC protein 

is separated from the polypeptide. 

29. The process of claim 1 wherein the bacterial cells are transformed with one or two expression 
vectors containing the nucleic acid encoding the DsbA or DsbC protein and the nucleic acid encoding the 
heterologous polypeptide. 

35 30. The process of claun 29 wherein the bacterial cells are transformed with two vectors 

respectively containing the nucleic acid encoding the DsbA or DsbC protein and the nucleic acid encoding the 
heterologous polypeptide. 
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3 1 . The process of claim 29 wrherein the nucleic acid encoding the DsbA or DsbC protein and the 
nucleic acid encoding the heterologous polypeptide are contained on one vector with which the bacterial cells 
are transformed. 

32. The process of claim 1 wherein the induction of expression of the nucleic acid encoding DsbA 
or DsbC is carried out by adding an inducer to tiie culture medium. 
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